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ASPERGILLUS ARABINOFURANOSIDASE 



The present invention relates to an enzyme. In addition, the present invention relates to 
a nucleotide sequence coding for the enzyme. Also, the present invention relates to a 
promoter, wherein the promoter can be used to control the expression of the nucleotide 
sequence coding for the enzyme. 

In particular, the enzyme of the present invention is an arabinofuranosidase enzyme 
having arabinoxylan degrading activity. 

It is known that it is desirable to direct expression of a gene of interest ("GOI") in certain 
tissues of an organism - such as a filamentous fungus (such as Aspergillus Niger) oreven 
a plant crop. The resultant protein or enzyme may be useful for the organism itself. For 
example, it may be desirable to produce crop protein products with an optimised amino 
acid composition and so increase the nutritive value of a crop. For example, the crop 
may be made more useful as a feed. 

In the alternative, it may be desirable to isolate the resultant protein or enzyme and then 
use the protein or enzyme to prepare, for example, food compositions. In this regard, 
the resultant protein or enzyme can be a component of the food composition or it can be 
used to prepare food compositions, including altering the characteristics or appearance 
of food compositions. It may even be desirable to use the organism, such as a 
filamentous fungus or a crop plant, to express non-plant genes, such as for the same 
purposes. 

Also, it may be desirable to use an organism, such as a filamentous fungus or a crop 
plant, to express mammalian genes. Examples of the latter products include interferons, 
insulin, blood factors and plasminogen activators. It is also desirable to use micro- 
organisms, such as filamentous fungi, to prepare products from GOIs by use of promoters 
that are active in the micro-organisms. 
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Fruit and vegetable ceil walls largely consist of polysaccharide, the major components 
being pectin, cellulose and xyloglucan (R.R. Selvendran and J. A. Robertson. IFR Report 
1989). Numerous cell wall models have been proposed which attempt to incorporate the 
essential properties of strength and flexibility (P. Albersheim, Sci. Am. 232. 81-95. 
1975; P. Albersheim, Plant Biochem. 3rd Edition (Bonner and Varner). Ac. Press. 1976: 
T. Hayashi. Ann. Rev. Plant Physiol. & Plant Mol. Biol., 40, 139-168, 1989). 

The composition of the plant cell wall is complex and variable. Polysaccharides are 
mainly found in the form of long chains of cellulose (the main structural component of 
the plant cell wall), hemicellulose (comprising various B-xylan chains) and pectic 
substances (consisting of galacturonans and rhamnogalacturonans; arabinans; and 
galactans and arabinogalactans). From the standpoint of the food industry, the pectic 
substances, arabinans in particular, have become one of the most important constituents 
of plant cell walls (Whitaker, J.R. (1984) Enzyme Microb. Tectinol., 6,341). 

One form of plant polysaccharide is arabinan. A review of arabinans may be found in 
EP-A-0506190. According to this document, arabinans consist of a main chain of a- 
(1-5) groups linked to one another. Side chains are linked a-(l-3) or sometimes a- 
(1-2) to the main a-(l-5)-L-arabinan backbone. In apple, for example, one third of the 
total arabinose is present in the side chains. The molecular weight of arabinan is 
normally about 15 kDa. 

Arabinans are degraded by enzymes collectively called arabinases. In this regard, 
arabinan-degrading activity is the ability of an enzyme to release arabinose residues, 
either monomers or oligomers, from arabinan backbones or from arabinan-containing side 
chains of other hemicellulose backbone structures such as arabinogalactans, or even the 
release of arabinose monomers via the cleavage of the 1-6 linkage between the terminal 
arabinofuranosyl unit and the intermediate glucosyl unit of monoterpenyl a-L- 
arabinofuranosyl glucosides. 



WO 96/29416 PC77EP96/01009 

3 

The activity of the arabinan degrading enzymes of EP-A-0506190 include: a) the ability 
to cleave (l-2)-a-L-arabinosidic linkages; b) the ability to cleave (l-3)-a-L-arabinosidic 
linkages; c) the ability to cleave (l-*5)-o;-L-arabinosidic linkages; d) the ability to cleave 
the 1-6 linkage between the terminal arabinofuranosyl unit and the intermediate glucosyl 
unit of monoterpenyl a-L-arabinofuranosyl elucosides. 

Arabinan-degrading enzymes are known to be produced by a variety of plants and 
microorganisms, among these, fungi such as those of the genera Aspergillus. Conicium, 
Rhodotorula (Kaji, A. (1984) Adv. Carbohydr. Chem. Biochem., 42, 383), Dichotomies 
(Brillouet et al. (1985) Carbohydrate Research, 144, 113), Ascomycetes and 
Basidomycetes (Sydow, G. (1977) DDR Patent Application No. 124.812). 

Another plant polysaccharide is xylan, whose major monosaccharide unit is xylose. 
Xylans are abundant components of the hemicelluloses. In monocoryledonous plants the 
dominant hemicellulose is an arabinoxylan, in which arabinose side chains are attached 
to a backbone of xylose residues. 

Arabinoxylans are carbohydrates found in the cell wall of cereals. A review of 
arabinoxylans and the enzymatic degradation thereof may be found in Voragen et at 
(1992 Characterisation of Cereal Arabinoxylans, Xylans and Xylanases pages 51-67, 
edited by J. Visser published by Elsevier Science Publishers). 

Typically, arabinoxylans comprise a xylose backbone linked together via 0-1,4- bonds. 
The xylose backbone is substituted with L-arabinose residues which are linked via a-\ 
bonds to the 2 or 3 position of the xylose residues. The xylose residues can be single or 
double substituted. In addition to substitution with arabinose the xylose residues can be 
substituted with acetyl groups, glucuronic acid and various other carbohydrates. The 
arabinose residues can be further substituted with phenolic acids such as ferulic acid and 
coumaric acid. The degree and kind of substitution depends on the source of the 
particular arabinoxylan. 
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Arabinoxylans are found in cereal cell wall where they are pan of the secondary cell 
wall. Arabinoxylans form about 3 % of wheat flour - part of it is water soluble (WSP). 
pan of it is water insoluble (WIP). 

Despite the fact that the arabinoxylans amount to only about 3 % of wheat the imponance 
of the arabinoxylan fraction is much higher. This is because the arabinoxylans of cereals ■ 
act as hydrocolloids, as they form a gel like structure with water. For example, the 
arabinoxylans of wheat flour bind up to 30% of the water in a dough despite the fact that 
they amount to only 3 % of the dry matter. When arabinoxylans bind water they 
increase the viscosity of the ground cereals and to such an extent that the cereals can 
become difficult to manage. 

The Theological properties of several systems where ground cereals are used can be 
manipulated using enzymes that degrade arabinoxylans. In modern bakery it is 
advantageous to reduce the viscosity of the dough in order to reduce the energy needed 
to process the doughs and also to get a higher volume of the bread. This is usually 
achieved by using enzymes that can degrade the xylose backbone of arabinoxylans. 

Enzymes that only cleave the arabinose side chains from the xylan backbone of 
arabinoxylan are, for the purposes of this application, collectively called arabinoxylan 
degrading enzymes. 

In feeds based on cereals, arabinoxylans in the cereals can increase the viscosity of the 
fluids in the intestines of the animals after the feeds have been ingested. This is a 
problem as it causes discomfort, such as indigestion, to the animals. Also, the nutritive 
value of the feeds is reduced. These problems can be avoided by addition of enzymes 
that degrade the arabinoxylan (such as xylanases) to the feed to avoid indigestion and to 
increase the nutritive value of the feed. However, some enzymes that degrade the 
arabinoxylans (especially some of the xylanases) require the presence of unsubstiruted 
backbones and so their activity can be limited. 
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Further discussions on arabinoxylans can be found in Xylans and Xylanases (1992. edited 
by J. Visser published by Elsevier Science Publishers). 



An arbinoxylan degrading enzyme is (l,4)-6-D-arabinoxylan arabinofuranohydrolase 
(AXH), as described by Kormelink et al 1991 (Kormelink, F.J.M., Searle-Van Leeuwen 
M.J.F., Wood. T.M., Voragen, A. G. J. (1991) Purification and characterization of a 
(l,4)-/3-D-arabinoxy lan arabinofuranohydrolase from Aspergillus awamori. Appl. Micro- 
biol. Biotechnol. 25:753-758). However, this document provides no sequence data for 
the enzyme or the nucleotide sequence coding for same or for the promoter for the same. 

Clearly, it would be useful to be able to degrade arabinoxylans, preferably by use of 
recombinant DNA techniques. 

The present invention seeks to provide an enzyme having arabinoxylan degrading activity; 
preferably wherein the enzyme can be prepared in certain or specific cells or tissues, such 
as in just a specific cell or tissue, of an organism, typically a filamentous fungus, 
preferably of the genus Aspergillus, such as Aspergillus niger, or even a plant. 

Also, the present invention seeks to provide a GOI coding for the enzyme that can be 
expressed preferably in specific cells or tissues, such as in certain or specific cells or 
tissues, of an organism, typically a filamentous fungus, preferably of the genus 
Aspergillus, such as Aspergillus niger. or even a plant. 

In addition, the present invention seeks to provide a promoter that is capable of directing 
expression of a GOI, such as a nucleotide sequence coding for the enzyme according to 
the present invention, preferably in certain specific cells or tissues, such as in just a 
specific cell or tissue, of an organism, typically a filamentous fungus, preferably of the 
genus Aspergillus, such as Aspergillus niger, or even a plant. Preferably, the promoter 
is used in Aspergillus wherein the product encoded by the GOI is excreted from the host 
organism into the surrounding medium. 
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Furthermore, the present invention seeks to provide constructs, vectors, plasmids, cells, 
tissues, organs and organisms comprising the GOI and/or the promoter, and methods of 
expressing the same, preferably in specific cells or tissues, such as expression in just a 
specific cell or tissue, of an organism, typically a filamentous fungus, preferably of the 
genus Aspergillus, or even a plant. 

According to a first aspect of the present invention there is provided an enzyme 
obtainable from Aspergillus, wherein the enzyme has the following characteristics: a MW 
of 33,270 D ± 50 D; a pi value of about 3.7; arabinoxylan degrading activity; a pH 
optima of from about 2.5 to about 7.0 (more especially from about 3.3 to about 4.6. 
more especially about 4); a temperature optima of from about 40°C to about 60°C (more 
especially from about 45°C to about 55°C, more especially about 50°C); and wherein the 
enzyme is capable of cleaving arabinose from the xylose backbone of an arabinoxylan. 

According to a second aspect of the present invention there is provided an enzyme having 
the sequence shown as SEQ. I.D. No. 1 or a variant, homologue or fragment thereof. 

According to a third aspect of the present invention there is provided an enzyme coded 
by the nucleotide sequence shown as SEQ." I.D. No. 2 or a variant, homologue or 
fragment thereof or a sequence complementary thereto. 

According to a fourth aspect of the present invention there is provided a nucleotide 
sequence coding for the enzyme according to the present invention. 

According to a fifth aspect of the present invention there is provided a nucleotide 
sequence having the sequence shown as SEQ. I.D. No. 2 or a variant, homologue or 
fragment thereof or a sequence complementary thereto. 

According to a sixth aspect of the present invention there is provided a promoter having 
the sequence shown as SEQ. I.D. No. 3 or a variant, homologue or fragment thereof or 
a sequence complementary thereto. 
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According to a seventh aspect of the present invention there is provided a terminator 
having the nucleotide sequence shown as SEQ. I.D. No. 13 or a variant, homologue or 
fragment thereof or a sequence complementary thereto. 

According to an eighth aspect of the present invention there is provided a signal sequence 
having the nucleotide sequence shown as SEQ. I.D. No. 14 or a variant, homologue or 
fragment thereof or a sequence complementary thereto. 

According to a ninth aspect of the present invention there is provided a process for 
expressing a GOI by use of a promoter, wherein the promoter is the promoter according 
to the present invention. 

According to a tenth aspect of the present invention there is provided the use of an 
enzyme according to the present invention to degrade an arabinoxylan. 

According to an eleventh aspect of the present invention there is provided a combination 
of enzymes to degrade an arabinoxylan, the combination comprising an enzyme according 
to the present invention and a xylanase. 

According to a twelfth aspect of the present invention there is provided plasmid NCIMB 
40703. or a nucleotide sequence obtainable therefrom for expressing an enzyme capable 
of degrading arabinoxylan or for controlling the expression thereof or for controlling the 
expression of another GOI. 

According to a thirteenth aspect of the present invention there is provided a signal 
sequence having the sequence shown as SEQ. I.D. No. 15 or a variant, homologue or 
fragment thereof. 

According to a fourteenth aspect of the present invention there is provided the use of the 
enzyme according to the present invention in the manufacture of a medicament or 
foodstuff to reduce or prevent indigestion and/or increase digestibility and/or increase 
nutrient absorption. 
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According to a fifteenth aspect of the present invention there is provided an 
arabinofuranosidase enzyme having arabinoxylan degrading activity, which is 
immunologically reactive with an antibody raised against a purified arabinofuranosidase 
enzyme having the sequence shown as SEQ. LD. No. 1. 

5 

According to a sixteenth aspect of the present invention there is provided an 
arabinofuranosidase promoter wherein the promoter is inducible by an intermediate in 
xylose metabolism. 

10 According to a seventeenth aspect of the present invention there is provided a process of 
reducing the viscosity of a branched substrate wherein the enzyme degrades the branches 
of the substrate but not the backbone of the substrate. 

According to a further aspect of the present invention there is provided the use of the 
15 enzyme of the present invention as a viscosity modifier. 

According to a further aspect of the present invention there is provided the use of the 
enzyme of the present invention to reduce the viscosity of pectin. 

20 Other aspects of the present invention include constructs, vectors, plasmids, cells, tissues, 
organs and transgenic organisms comprising the aforementioned aspects of the present 
invention. 

Other aspects of the present invention include methods of expressing or allowing 
25 expression or transforming any one of the nucleotide sequence, the construct, the 
plasmid, the vector, the cell, the tissue, the organ or the organism, as well as the 
products thereof. 



30 



Additional aspects of the present invention include uses of the promoter for expressing 
GOIs in culture media such as a broth or in a transgenic organism. 
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Further aspects of the present invention include uses of the enzyme for preparing or 
treating foodstuffs, including animal feed. 



Preferably the enzyme is coded by the nucleotide sequence shown as SEQ. I.D. No. 2 
or a variant, homologue or fragment thereof or a sequence complementary thereto. 

Preferably the nucleotide sequence has the sequence shown as SEQ. I.D. No. 2 or a 
variant, homologue or fragment thereof or a sequence complementary thereto. 

Preferably the nucleotide sequence is operatively linked to a promoter. 

Preferably the promoter comprises the sequence CCAAT. 

Preferably the promoter is the promoter having the sequence shown as SEQ. I.D. No. 
3 or a variant, homologue or fragment thereof or a sequence complementary thereto. 

Preferably, the promoter comprises the 100 bps sequence from the Xma 111 to the 
BamHl sites. 

Preferably the promoter of the present invention is operatively linked to a GOI. 
Preferably the GOI comprises a nucleotide sequence according to the present invention. 
Preferably the transgenic organism is a fungus. 

Preferably the transgenic organism is a filamentous fungus, more preferably of the genus 
Aspergillus. 

Preferably the transgenic organism is a plant. 

Preferably, in the use, the enzyme is used in combination with a xylanase, preferably an 
endoxylanase. 
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Highly preferred embodiments of each of the aspects of the present invention do not 
include any one of the native enzyme, the native promoter or the native nucleotide 
sequence in its natural environment. 

Preferably, in any one of the piasmid, the vector such as an expression vector or a 
transformation vector, the cell, the tissue, the organ, the organism or the transgenic 
organism, the promoter is present in combination with at least one GOI. 

Preferably the promoter and the GOI are stably incorporated within the transgenic 
organism's genome. 

Preferably the transgenic organism is a filamentous fungus, preferably of the genus 
Aspergillus, more preferably Aspergillus niger. The transgenic organism can even be a 
plant, such as a monocot or dicot plant. 

A highly preferred embodiment is an enzyme obtainable from Aspergillus, wherein the 
enzyme has the following characteristics: a MW of 33.270 D ± 50 D: a pi value of 
about 3.7; arabinoxylan degrading activity; a pH optima of from about 2.5 to about 7.0 
(more especially from about 3.3 to about 4.6, more especially about 4); a temperature 
optima of from about 40°C to about 60°C (more especially from about 45°C to about 
55°C, more especially about 50°C); and wherein the enzyme is capable of cleaving 
arabinose from the xylose backbone of an arabinoxylan; wherein the enzyme has the 
sequence shown as SEQ. I. D. No. 1 or a variant, homologue or fragment thereof. 

Another highly preferred embodiment is an enzyme obtainable from Aspergillus, wherein 
the enzyme has the following characteristics: a MW of 33,270 D ± 50 D; a pi value of 
about 3.7; arabinoxylan degrading activity; a pH optima of from about 2.5 to about 7.0 
(more especially from about 3.3 to about 4.6, more especially about 4): a temperature 
optima of from about 40°C to about 60°C (more especially from about 45°C to about 
55°C, more especially about 50°C); and wherein the enzyme is capable of cleaving 
arabinose from the xylose backbone of an arabinoxylan; wherein the enzyme is coded by 
the nucleotide sequence shown as SEQ. I. D. No. 2 or a variant, homologue or fragment 
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thereof or a sequence complementary thereto. 
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The advantages of the present invention are that it provides a means for preparing an 
arabinofuranosidase enzyme having arabinoxylan degrading activity and the nucleotide 
sequence coding for the same. In addition, it provides a promoter that can control the 
expression of that, or another, nucleotide sequence. 

Other advantages are that the enzyme of the present invention can affect the viscosity of 
ground cereals, such as dough, to ease the handling thereof and for example to get a 
higher volume of the bread. 

The enzyme of the present invention is also advantageous for feed because it degrades 
arabinoxylan and thus increases the nutritive value of the feed. In addition, it reduces 
the viscosity of the arabinoxylan in the intestine of the animals and so reduces or prevents 
indigestion. 

The combination of the use of the enzyme of the present invention with a xylanase is 
particularly advantageous because the enzyme of the present invention and the xylanase 
have a surprising and unexpected synergistic effect with each other. 

In this regard, the enzyme of the present invention increases the degradative effect of the 
xylanase, and the xylanase increases the degradative effect of the enzyme of the present 
invention. It is believed that the activity of the xylanase is increased because the enzyme 
of the present invention provides a polysaccharide substrate having fewer substituted 
groups. 

The present invention therefore provides an enzyme . having arabinoxylan degrading 
activity wherein the enzyme can be prepared in certain or specific cells or tissues, such 
as in just a specific cell or tissue, of an organism, typically a filamentous fungus, 
preferably of the genus Aspergillus, such as Aspergillus niger. The enzyme may even be 
prepared in a plant. 
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More in particular, the enzyme of the present invention is capable of specifically cleaving 
arabinose from the xylose backbone of arabinoxylan. 

The arabinofuranosidase of the present invention is different from the 
5 arabinofuranosidases previously known. In this regard, the previous described 
arabinofuranosidases - such as those of EP-A-0506190 - are characterised by their ability 
to degrade unbranched arabinan. and are assayed using p-nitrophenyl-arabinoside. 

The arabinofuranosidase of the present invention does not degrade unbranched arabinan. 
10 and only a minor activity is seen on nitrophenyl-arabinoside. In contrast, the 
arabinofuranosidase of the present invention is useful for degrading arabinoxylan. 
Therefore, the arabinofuranosidase of the present invention is quite different from the 
previous isolated arabinofuranosidases. 



15 Also, the present invention provides a GOI coding for the enzyme that can be expressed 
preferably in specific cells or tissues, such as in certain or specific cells or tissues, of an 
organism, typically a filamentous fungus, preferably of the genus Aspergillus , such as 
Aspergillus niger. The GOI may even be expressed in a plant. 

20 In addition, the present invention provides a promoter that is capable of directing 
expression of a GOI, such as a nucleotide sequence coding for the enzyme according to 
the present invention, preferably in certain specific cells or tissues, such as in just a 
specific cell or tissue, of an organism, typically a filamentous fungus, preferably of the 
genus Aspergillus, such as Aspergillus niger, or even a plant. Preferably, the promoter 

25 is used in Aspergillus wherein the product encoded by the GOI is excreted from the host 
organism into the surrounding medium. The promoter may even be tailored (if 
necessary) to express a GOI in a plant. 

The present invention also provides constructs, vectors, plasmids, cells, tissues, organs 
30 and organisms comprising the GOI and/or the promoter, and methods of expressing the 
same, preferably in specific cells or tissues, such as expression in just a specific cell or 
tissue, of an organism, typically a filamentous fungus, preferably of the genus 
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The terms "variant", "homologue" or "fragment" in relation to the enzyme include any 
substitution of, variation of, modification of, replacement of, deletion of or addition of 
one (or more) amino acid from or to the sequence providing the resultant amino acid 
sequence has arabinoxylan degrading activity, preferably having at least the same activity 
of the enzyme shown in the sequence listings (SEQ I.D. No. 1 or 12). In particular, the 
term "homologue" covers homology with respect to structure and/or function providing 
the resultant enzyme has arabinoxylan degrading activity. With respect to sequence 
homology, preferably there is at least 75%, more preferably at least 85%, more 
preferably at least 90% homology to SEQ ID NO. 1 shown in the attached sequence 
listings. More preferably there is at least 95%, more preferably at least 98%, homology 
to SEQ ID NO. 1 shown in the attached sequence listings. 

The terms "variant", "homologue" or "fragment" in relation to the nucleotide sequence 
coding for the enzyme include any substitution of, variation of, modification of, 
replacement of, deletion of or addition of one (or more) nucleic acid from or to the 
sequence providing the resultant nucleotide sequence codes for an enzyme having 
arabinoxylan degrading activity, preferably having at least the same activity of the 
enzyme shown in the sequence listings (SEQ I.D. No. 2 or 12). In particular, the term 
"homologue" covers homology with respect to structure and/or function providing the 
resultant nucleotide sequence codes for an enzyme having arabinoxylan degrading 
activity. With respect to sequence homology, preferably there is at least 75%, more 
preferably at least 85%, more preferably at least 90% homology to SEQ ID NO. 2 shown 
in the attached sequence listings. More preferably there is at least 95%, more preferably 
at least 98%, homology to SEQ ID NO. 2 shown in the attached sequence listings. 

The terms "variant", "homologue" or "fragment" in relation to the promoter include any 
substitution of, variation of, modification of, replacement of deletion of or addition of 
one (or more) nucleic acid from or to the sequence providing the resultant nucleotide 
sequence has the ability to act as a promoter in an expression system - such as the 
transformed cell or the transgenic organism according to the present invention. In 
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particular, the term "homologue" covers homology with respect to structure and/or 
function providing the resultant nucleotide sequence has the ability to act as a promoter. 
With respect to sequence homology, preferably there is at least 75%. more preferably at 
least 85%. more preferably at least 90% homology to SEQ ID NO. 3 shown in the 
attached sequence listings. More preferably there is at least 95%, more preferably at 
least 98%. homology to SEQ ID NO. 3 shown in the attached sequence listings. 

The terms "variant", "homologue" or "fragment" in relation to the terminator or signal 
nucleotide sequences include any substitution of, variation of, modification of. 
replacement of. deletion of or addition of one (or more) nucleic acid from or to the 
sequence providing the resultant nucleotide sequence has the ability to act as a terminator 
or codes for an amino acid sequence that has the ability to act as a signal sequence 
respectively in an expression system - such as the transformed cell or the transgenic 
organism according to the present invention. In particular, the term "homologue" covers 
homology with respect to structure and/or function providing the resultant nucleotide 
sequence has the ability to act as or code for a terminator or signal respectively. With 
respect to sequence homology, preferably there is at least 75%, more preferably at least 
85%, more preferably at least 90% homology to SEQ ID NO.s 13 and 14 (respectively) 
shown in the attached sequence listings. More preferably there is at least 95%, more 
preferably at least 98%, homology to SEQ ID NO.s 13 and 14 (respectively) shown in 
the attached sequence listings. 

The terms "variant", "homologue" or "fragment" in relation to the signal amino acid 
sequence include any substitution of, variation of, modification of, replacement of, 
deletion of or addition of one (or more) amino acid from or to the sequence providing 
the resultant sequence has the ability to act as a signal sequence in an expression system - 
such as the transformed cell or the transgenic organism according to the present 
invention. In particular, the term "homologue" covers homology with respect to structure 
and/or function providing the resultant nucleotide sequence has the ability to act as or 
code for a signal respectively. With respect to sequence homology, preferably there is 
at least 75%, more preferably at least 85%, more preferably at least 90% homology to 
SEQ ID NO 15 shown in the attached sequence listings. More preferably there is at least 
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957c. more preferably at least 98%, homology to SEQ ID NO 15 shown in the attached 
sequence listings. 

The above terms are synonymous with allelic variations of the sequences. 

The term "complementary" means that the present invention also covers nucleotide 
sequences that can hybridise to the nucleotide sequences of the coding sequence or the 
promoter sequence, respectively. 

The term "nucleotide" in relation to the present invention includes genomic DNA, cDNA, 
synthetic DNA. and RNA. Preferably it means DNA, more preferably cDNA for the 
coding sequence of the present invention. 

The term "construct" - which is synonymous with terms such as "conjugate", "cassette" 
and "hybrid" - includes a GOI directly or indirectly attached to a promoter. An example 
of an indirect attachment is the provision of a suitable spacer group such as an intron 
sequence, such as the Shl-\nxion or the ADH intron, intermediate the promoter and the 
GOI. The same is true for the term "fused" in relation to the present invention which 
includes direct or indirect attachment. In each case, it is highly preferred that the terms 
do not cover the natural combination of the gene coding for the enzyme ordinarily 
associated with the wild type gene promoter and when they are both in their natural 
environment. A highly preferred embodiment is the or a GOI being operatively linked 
to a or the promoter. 

The construct may even contain or express a marker which allows for the selection of the 
genetic construct in, for example, a filamentous fungus, preferably of the genus 
Aspergillus, such as Aspergillus niger. or plants, preferably cereals, such as maize, rice, 
barley etc., into which it has been transferred. Various markers exist which may be 
used, such as for example those encoding mannose-6-phosphate isomerase (especially for 
plants) or those markers that provide for antibiotic resistance - e.g. resistance to G418, 
hygromycin, bleomycin, kanamycin and gentamycin. 
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The term "vector" includes expression vectors and transformation vectors. 

The term "expression vector" means a construct capable of in vivo or in vitro expression. 

The term "transformation vector" means a construct capable of being transferred from 
one species to another - such as from an E.coli plasmid to a filamentous fungus, 
preferably of the genus Aspergillus. It may even be a construct capable of being 
transferred from an E.coli plasmid to an Agrobacterium to a plant. 

The term "tissue" includes tissue per se and organ. 

The term "organism" in relation to the present invention includes any organism that could 
comprise the promoter according to the present invention and/or the nucleotide sequence 
coding for the enzyme according to the present invention and/or products obtained 
therefrom, wherein the promoter can allow expression of a GOI and/or wherein the 
nucleotide sequence according to the present invention can be expressed when present in 
the organism. 

Preferably the organism is a filamentous fungus, preferably of the genus Aspergillus. 
more preferably Aspergillus niger. 

The term "transgenic organism" in relation to the present invention includes any organism 
that comprises the promoter according to the present invention and/or the nucleotide 
sequence coding for the enzyme according to the present invention and/or products 
obtained therefrom, wherein the promoter can allow expression of a GOI and/or wherein 
the nucleotide sequence according to the present invention can be expressed within the 
organism. Preferably the promoter and/or the nucleotide sequence is (are) incorporated 
in the genome of the organism. 

Preferably the transgenic organism is a filamentous fungus, preferably of the genus 
Aspergillus, more preferably Aspergillus niger. 
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Therefore, the transgenic organism of the present invention includes an organism 
comprising any one of. or combinations of, the promoter according to the present 
invention, the nucleotide sequence coding for the enzyme according to the present 
invention, constructs according to the present invention, vectors according to the present 
5 invention, plasmids according to the present invention, cells according to the present 
invention, tissues according to the present invention or the products thereof. For example 
the transgenic organism can comprise a GOI, preferably an exogenous nucleotide 
sequence, under the control of the promoter according to the present invention. The 
transgenic organism can also comprise the nucleotide sequence coding for the enzyme of 
10 the present invention under the control of a promoter, which may be the promoter 
according to the present invention. 



In a highly preferred embodiment, the transgenic organism does not comprise the 
combination of the promoter according to the present invention and the nucleotide 

15 sequence coding for the enzyme according to the present invention, wherein both the 
promoter and the nucleotide sequence are native to that organism and are in their natural 
environment. Thus, in these highly preferred embodiments, the present invention does 
not cover the native nucleotide coding sequence according to the present invention in its 
natural environment when it is under the control of its native promoter which is also in 

20 its natural environment. In addition, in this highly preferred embodiment, the present 
invention does not cover the native enzyme according to the present invention when it is 
in its natural environment and when it has been expressed by its native nucleotide coding 
sequence which is also in its natural environment and when that nucleotide sequence is 
under the control of its native promoter which is also in its narural environment. 

25 

The term "promoter" is used in the normal sense of the an, e.g. an RNA polymerase 
binding site in the Jacob-Mond theory of gene expression. 



In one aspect, the promoter of the present invention is capable of expressing a GOI, 
30 which can be the nucleotide sequence coding for the enzyme of the present invention. 
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In another aspect, the nucleotide sequence according to the present invention is under the 
control of a promoter that allows expression of the nucleotide sequence. In this regard, 
the promoter need not necessarily be the same promoter as that of the present invention. 
In this aspect, the promoter may be a cell or tissue specific promoter. If, for example, 
the organism is a plant then the promoter can be one that affects expression of the 
nucleotide sequence in any one or more of stem, sprout, root and leaf tissues. 

By way of example, the promoter for the nucleotide sequence of the present invention can 
be the a-Amy 1 promoter (otherwise known as the Amy 1 promoter, the Amy 637 
promoter or the a-Amy 637 promoter) as described in our co-pending UK patent 
application No. 9421292.5 filed 21 October 1994. That promoter comprises the sequence 
shown in Figure 1 . 

Alternatively, the promoter for the nucleotide sequence of the present invention can be 
the a-Amy 3 promoter (otherwise known as the Amy 3 promoter, the Amy 351 promoter 
or the a-Amy 351 promoter) as described in our co-pending UK patent application No. 
9421286.7 filed 21 October 1994. That promoter comprises the sequence shown in 
Fieure 2. 



Preferably, the promoter is the promoter of the present invention. 

In addition to the nucleotide sequences described above, the promoters, particularly that 
of the present invention, could additionally include features to ensure or to increase 
expression in a suitable host. For example, the features can be conserved regions such 
as a Pribnow Box or a TATA box. The promoters may even contain other sequences to 
affect (such as to maintain, enhance, decrease) the levels of expression of the GOI. For 
example, suitable other sequences include the S/j/-intron or an ADH intron. Other 
sequences include inducible elements - such as temperature, chemical, light or stress 
inducible elements. 

Also, suitable elements to enhance transcription or translation may be present. An 
example of the latter element is the TMV 5" signal sequence (see Sleat Gene 217 [1987] 
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217-225; and Dawson Plant Mol. Biol. 23 [1993] 97). 
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In addition the present invention also encompasses combinations of promoters and/or 
nucleotide sequences coding for proteins or enzymes and/or elements. For example, the 
present invention encompasses the combination of a promoter according to the present 
invention operatively linked to a GOI, which could be a nucleotide sequence according 
to the present invention, and another promoter such as a tissue specific promoter 
operatively linked to the same or a different GOI. 

The present invention also encompasses the use of promoters to express a nucleotide 
sequence coding for the enzyme according to the present invention, wherein a pan of the 
promoter is inactivated but wherein the promoter can still function as a promoter. Partial 
inactivation of a promoter in some instances is advantageous. 

In particular, with the Amy 351 promoter mentioned earlier it is possible to inactivate a 
part of it so that the partially inactivated promoter expresses GOIs in a more specific 
manner such as in just one specific tissue type or organ. 

The term "inactivated" means partial inactivation in the sense that the expression pattern 
of the promoter is modified but wherein the partially inactivated promoter still functions 
as a promoter. However, as mentioned above, the modified promoter is capable of 
expressing a GOI in at least one (but not all) specific tissue of the original promoter. 
One such promoter is the Amy 351 promoter described above. 

Examples of partial inactivation include altering the folding pattern of the promoter 
sequence, or binding species to parts of the nucleotide sequence, so that a pan of the 
nucleotide sequence is not recognised by, for example. RNA polymerase. Another, and 
preferable, way of panially inactivating the promoter is to truncate it to form fragments 
thereof. Another way would be to mutate at least a pan of the sequence so that the RNA 
polymerase can not bind to that pan or another pan. 
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Another modification is to mutate the binding sites for regulatory proteins for example 
the CreA protein known from filamentous fungi to exert carbon catabolite repression, and 
thus abolish the catabolite repression of the native promoter. 

The term "GO I" with reference to the present invention means any gene of interest. A 
GOI can be any nucleotide that is either foreign or natural to the organism (e.g. 
filamentous fungus, preferably of the genus Aspergillus, or a plant) in question. Typical 
examples of a GOI include genes encoding for proteins and enzymes that modify 
metabolic and catabolic processes. The GOI may code for an agent for introducing or 
increasing pathogen resistance. The GOI may even be an antisense construct for 
modifying the expression of natural transcripts present in the relevant tissues. The GOI 
may even code for a non-natural protein of a filamentous fungus, preferably of the genus 
Aspergillus, or a compound that is of benefit to animals or humans. 

For example, the GOI could code for a pharmaceutical ly active protein or enzyme such 
as any one of the therapeutic compounds insulin, interferon, human serum albumin, 
human growth factor and blood clotting factors. In this regard, the transformed cell or 
organism could prepare acceptable quantities of the desired compound which would be 
easily retrievable from, the cell or organism. The GOI may even be a protein giving 
nutritional value to a food or crop. Typical examples include plant proteins that can 
inhibit the formation of anti-nutritive factors and plant proteins that have a more desirable 
ammo acid composition (e.g. a higher lysine content than a non-transgenic plant). The 
GOI may even code for an enzyme that can be used in food processing such as chymosin. 
thaumatin and or-galactosidase. The GOI can be a gene encoding for any one of a pest 
toxin, an antisense transcript such as that for patatin or a-amylase, ADP-glucose 
pyrophosphorylase (e.g. see EP-A-0455316), a protease antisense or a glucanase. 

The GOI can be the nucleotide sequence coding for the a-amylase enzyme which is the 
subject of our co-pending UK patent application 9413439.2 filed on 4 July 1994, the 
sequence of which is shown in Figure 3. The GOI can be the nucleotide sequence coding 
for the a-amylase enzyme which is the subject of our co-pending UK patent application 
9421290.9 filed on 21 October 1994. the sequence of which is shown in Figure 4. The 
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GOI can be any of the nucleotide sequences coding for the ADP-glucose 
pyrophosphorylase enzymes which are the subject of our co-pending PCT patent 
application PCT/EP94/01082 filed 7 April 1994, the sequences of which are shown in 
Figures 5 and 6. The GOI can be any of the nucleotide sequences coding for the a- 
glucan lyase enzyme which are described in our co-pending PCT patent application 
PCT/EP94/03397 filed 15 October 1994, the sequences of which are shown in Figures 
7-10. 



In one preferred embodiment, the GOI is a nucleotide sequence coding for the enzyme 
according to the present invention. 

As mentioned above, a preferred host organism is of the genus Aspergillus, such as 
Aspergillus niger. The transgenic Aspergillus according to the present invention can be 
prepared by following the teachings of RambosekJ. and Leach,J. 1987 (Recombinant 
DNA in filamentous fungi: Progress and Prospects. CRC Crit. Rev. Biotechnol. 6:357- 
393), Davis R.W. 1994 (Heterologous gene expression and protein secretion in Asperg- 
illus. In: Maninelli S.D., Kinghorn J.R.( Editors) Aspergillus: 50 years on. Progress in 
industrial microbiology vol 29, Elsevier Amsterdam 1994. pp 525-560), Ballance,D.J. 
1991 (Transformation systems for Filamentous Fungi and an Overview of Fungal Gene 
structure. In :Leong,S.A., Berka R.M. (Editors) Molecular Industrial Mycology. Systems 
and Applications for Filamentous Fungi. Marcel Dekker Inc. New York 1991. pp 1-29) 
and Turner G. 1994 (Vectors for genetic manipulation. In: Maninelli S.D.. Kinghorn 
J.R.( Editors) Aspergillus: 50 years on. Progress in industrial microbiology vol 29. 
Elsevier Amsterdam 1994. pp. 641-666). However, the following commentary provides 
a summary of those teachings for producing transgenic Aspergillus according to the 
present invention. 

Filamentous fungi have during almost a century been widely used in industry for produc- 
tion of organic compounds and enzymes. Traditional Japanese koji and soy fermentations 
have used Aspergillus sp for hundreds of years. In this century Aspergillus niger has 
been used for production of organic acids particular citric acid and for production of 
various enzymes for use in industry. 
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There are two major reasons for that filamentous fungi have been so widely used in 
industry. First filamentous fungi can produce high amounts of extracellular products, for 
example enzymes and organic compounds such as antibiotics or organic acids. Second 
filamentous fungi can grow on low cost substrates such as grains, bran, beet pulp etc. 
The same reasons have made filamentous fungi attractive organisms as hosts for 
heterologous expression according to the present invention. 

In order to prepare the transgenic Aspergillus, expression constructs are prepared by 
inserting a GOI (such as an amylase or SEQ. I.D. No. 2) into a construct designed for 
expression in filamentous fungi. 

Several types of constructs used for heterologous expression have been developed. The 
constructs contain the promoter according to the present invention (or if desired another 
promoter if the GOI codes for the enzyme according to the present invention) which is 
active in fungi. Examples of promoters other than that of the present invention include 
a fungal promoter for a highly expressed extracellular enzyme, such as the glucoamylase 
promoter or the a-amylase promoter. The GOI can be fused to a signal sequence (such 
as that of the present invention or another suitable sequence) which directs the protein 
encoded by the GOI to be secreted. Usually a signal sequence of fungal origin is used, 
such as that of the present invention. A terminator active in fungi ends the expression 
system, such as that of the present invention. 

Another type of expression system has been developed in fungi where the GOI is fused 
to a smaller or a larger part of a fungal gene encoding a stable protein. This can stabilize 
the protein encoded by the GOI. In such a system a cleavage site, recognized by a 
specific protease, can be introduced between the fungal protein and the protein encoded 
by the GOI, so the produced fusion protein can be cleaved at this position by the specific 
protease thus liberating the protein encoded by the GOI ("POP). By way of example, 
one can introduce a site which is recognized by a KEX-2 like peptidase found in at least 
some Aspergilli. Such a fusion leads to cleavage in vivo resulting in protection of the 
POI and production of POI and not a larger fusion protein. 
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Heterologous expression in Aspergillus has been reported for several genes coding for 
bacterial, fungal, vertebrate and plant proteins. The proteins can be deposited 
intracellular^ if the GOI is not fused to a signal sequence. Such proteins will accumulate 
in the cytoplasm and will usually not be glycosylated which can be an advantage for some 
bacterial proteins. If the GOI is equipped with a signal sequence the protein will 
accumulate extracellular)'. 

With regard to product stability and host strain modifications, some heterologous proteins 
are not very stable when they are secreted into the culture fluid of fungi. Most fungi 
produce several extracellular proteases which degrade heterologous proteins. To avoid 
this problem special fungal strains with reduced protease production have been used as 
host for heterologous production. 

For the transformation of filamentous fungi, several transformation protocols have been 
developed for many filamentous fungi (Ballance 1991, ibid). Many of them are based 
on preparation of protoplasts and introduction of DN A into the protoplasts using PEG and 
Ca 2 + ions. The transformed protoplasts then regenerate and the transformed fungi are 
selected using various selective markers. Among the markers used for transformation are 
a number of auxotrophic markers such as argB, trpC, niaD and pyrG, antibiotic 
resistance markers such as benomyl resistance, hygromycin resistance and phleomycin 
resistance. A very common used transformation marker is the amdS gene of A. nidulans 
which in high copy number allows the fungus to grow with acrylamide as the sole 
nitrogen source. 

Even though the enzyme, the nucleotide sequence coding for same and the promoter of 
the present invention are not disclosed in EP-B-0470145 and CA-A-2006454, those two 
documents do provide some useful background commentary on the types of techniques 
that may be employed to prepare transgenic plants according to the present invention. 
Some of these background teachings are now included in the following commentary. 

The basic principle in the construction of genetically modified plants is to insert genetic 
information in the plant genome so as to obtain a stable maintenance of the inserted 
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genetic material. 



Several techniques exist for inserting the genetic information, the two main principles 
being direct introduction of the genetic information and introduction of the genetic 
information by -use of a vector system. A review of the general techniques may be found 
in articles by Potrykus (Annu Rev Plant Physiol Plant Mol Biol [1991] 42:205-225) and 
Christou (Agro-Food-Industry Hi-Tech March/ April 1994 17-27). 

Thus, in one aspect, the present invention relates to a vector system which carries a 
promoter or nucleotide sequence or construct according to the present invention and 
which is capable of introducing the promoter or nucleotide sequence or construct into the 
genome of an organism, such as a plant. 

The vector system may comprise one vector, but it can comprise two vectors. In the case 
of two vectors, the vector system is normally referred to as a binary vector system. 
Binary vector systems are described in further detail in Gynheung An et al. (1980), 
Binary Vectors, Plant Molecular Biology Manual A3, 1-19. 

One extensively employed system for transformation of plant cells with a given promoter 
or nucleotide sequence or construct is based on the use of a Ti plasmid from 
Agrobacterium tumefaciens or a Ri plasmid from Agrobacterium rhizogenes An et al. 
(1986), Plant PhysioL 81, 301-305 and Butcher D.N. et al. (1980), Tissue Culture 
Methods for Plant Pathologists, eds.: D.S. Ingrams and J. P. Helgeson, 203-208. 

Several different Ti and Ri plasmids have been constructed which are suitable for the 
construction of the plant or plant cell constructs described above. A non-limiting 
example of such a Ti plasmid is pGV3850. 

The promoter or nucleotide sequence or construct of the present invention should 
preferably be inserted into the Ti-plasmid between the terminal sequences of the T-DNA 
or adjacent a T-DNA sequence so as to avoid disruption of the sequences immediately 
surrounding the T-DNA borders, as at least one of these regions appear to be essential 
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for insertion of modified T-DNA into the plant genome. 
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As will be understood from the above explanation, if the organism is a plant, then the 
vector system of the present invention is preferably one which contains the sequences 
necessary to infect the plant (e.g. the vir region) and at least one border pan of a T-DNA 
sequence, the border pan being located on the same vector as the genetic construct. 

Furthermore, the vector system is preferably an Agrobacterium tumefaciens Ti-piasmid 
or an Agrobacterium rhizogenes Ri-plasmid or a derivative thereof, as these plasmids are 
well-known and widely employed in the construction of transgenic plants, many vector 
systems exist which are based on these plasmids or derivatives thereof. 

In the construction of a transgenic plant the promoter or nucleotide sequence or construct 
of the present invention may be first constructed in a microorganism in which the vector 
can replicate and which is easy to manipulate before insertion into the plant. An example 
of a useful microorganism is E. coli, but other microorganisms having the above 
properties may be used. When a vector of a vector system as defined above has been 
constructed in E. colt, it is transferred, if necessary, into a suitable Agrobacterium strain, 
e.g. Agrobacterium tumefaciens. The Ti-plasmid harbouring the promoter or nucleotide 
sequence or construct of the invention is thus preferably transferred into a suitable 
Agrobacterium strain, e.g. A. tumefaciens, so as to obtain an Agrobacterium cell 
harbouring the promoter or nucleotide sequence or construct of the invention, which 
DNA is subsequently transferred into the plant ceil to be modified. 

As reported in CA-A-2006454, a large amount of cloning vectors are available which 
contain a replication system in E. coli and a marker which allows a selection of the 
transformed cells. The vectors contain for example p,BR 322, pUC series, Ml 3 mp 
series, pACYC 184 etc. 

In this way, the nucleotide or construct or promoter of the present invention can be 
introduced into a suitable restriction position in the vector. The contained plasmid is 
used for the transformation in E.coli. The E.coli cells are cultivated in a suitable nutrient 
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medium and then harvested and lysed. The plasmid is then recovered. As a method of 
analysis there is generally used sequence analysis, restriction analysis, electrophoresis and 
further biochemical-molecular biological methods. After each manipulation, the used 
DNA sequence can be restricted and connected with the next DNA sequence. Each 
sequence can be cloned in the same or different plasmid. 

After each introduction method of the desired promoter or construct or nucleotide 
sequence according to the present invention in the plants the presence and/or insertion of 
further DNA sequences may be necessary. If, for example, for the transformation the 
Ti- or Ri-plasmid of the plant cells is used, at least the right boundary and often however 
the right and the left boundary of the Ti- and Ri-plasmid T-DNA, as flanking areas of 
the introduced genes, can be connected. The use of T-DNA for the transformation of 
plant cells has been intensively studied and is described in EP-A-120516; Hoekema, in: 
The Binary Plant Vector System Offset-dmkkerij Kanters B.B., Alblasserdam, 1985, 
Chapter V; Fraley, et al., Crit. Rev. Plant Sci., 4:1-46; and An et al., EMBO J. (1985) 
4:277-284. 

Direct infection of plant tissues by Agrobacterium is a simple technique which has been 
widely employed and which is described in Butcher D.N. et al. (1980), Tissue Culture 
Methods for Plant Pathologists, eds.: D.S. Ingrams and J. P. Helgeson, 203-208. For 
further teachings on this topic see Potrykus (Annu Rev Plant Physiol Plant Mol Biol 
[1991] 42:205-225) and Christou (Agro-Food-Industry Hi-Tech March/ April 1994 17-27). 
With this technique, infection of a plant may be done on a certain pan or tissue of the 
plant, i.e. on a part of a leaf, a root, a stem or another pan of the plant. 

Typically, with direct infection of plant tissues by Agrobacterium carrying the promoter 
and/or the GOI, a plant to be infected is wounded, e.g. by cutting the plant with a razor 
or puncturing the plant with a needle or rubbing the plant with an abrasive. The wound 
is then inoculated with the Agrobacterium. The inoculated plant or plant pan is then 
grown on a suitable culture medium and allowed to develop into mature plants. 
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When plant cells are constructed, these cells may be grown and maintained in accordance 
with well-known tissue culturing methods such as by culturing the cells in a suitable 
culture medium supplied with the necessary growth factors such as amino acids, plant 
hormones, vitamins, etc. 

Regeneration of the transformed cells into genetically modified plants may be 
accomplished using known methods for the regeneration of plants from cell or tissue 
cultures, for example by selecting transformed shoots using an antibiotic and by 
subculruring the shoots on a medium containing the appropriate nutrients, plant 
hormones, etc. 

Further teachings on plant transformation may be found in EP-A-0449375. 

In summation, the present invention provides an arabinofuranosidase enzyme having 
arabinoxylan degrading activity and the nucleotide sequence coding for the same. In 
addition, it provides a promoter that can control the expression of that, or another, 
nucleotide sequence. In addition it includes terminator and signal sequences for the 
same. 

The following sample was deposited in accordance with the Budapest Treaty at the 
recognised depositary The National Collections of Industrial and Marine Bacteria Limited 
(NCIMB) at 23 St. Machar Drive, Aberdeen, Scotland, United Kingdom. AB2 1RY on 
16 January 1995: 



E.coli containing plasmid pB53.1 {i.e. E.coli DH5a- 
pB53.1}. The deposit number is NCIMB 40703. 



The present invention will now be described by way of example. 



30 



In the following Examples reference is made to the accompanying figures in which: 



WO 96/29416 PCI7EP96/01009 

28 

Figures 1-10 are sequences of promoters and GOIs of earlier patent applications that are 
useful for use with the aspects of the present invention; 

Figure 1 1 is a plasmid map of the plasmid pB53.1, which is the subject of deposit 
5 NCIMB 40703; 

Figure 12 is a schematic diagram of deletions made to the promoter of the present 
invention; 

10 Figure 13 is a plasmid map of pXP-AMY; 

Figure 14 is a plasmid map of pXP-XssAMY; 
Figure 15 is a graph; 

15 

Figure 16 is an HP-TLC profile; 
Figure 17 is an HP-TLC profile; 
20 Figure 18 is an HPLC profile; 
Figure 19 is a viscosity plot; 
Figure 20 is an activity plot; 

25 

Figure 21 is an activity plot; and 

Figure 22 is an activity plot. 

30 The following text discusses the use of inter alia recombinant DNA techniques. General 
teachings of recombinant DNA techniques may be found in Sambrook, J., Fritsch, E.F., 
Maniatis T. (Editors) Molecular Cloning. A laboratory manual. Second edition. Cold 
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Spring Harbour Laboratory* Press. New York 1989. 



In these Examples, the enzyme of the present invention is sometimes referred to as AbfC. 
In addition, the promoter of the present invention is sometimes referred to as the AbfC 
promoter. 

Purification of the arabinofuranosidase 

Aspergillus niger 3M43 was grown in medium containing wheat bran and beet pulp. The 
fermentation broth was separated from the solid pan of the broth by filtration. 
Concentrated fermentation broth was loaded on a 25X 100mm Q-SEPH AROSE 
(Pharmacia) high Performance column, equilibrated with 20 mM Tris, HC1 pH 7.5, and 
a linear gradient from 0-500 Mm NaCl was performed and fractions of the eluate was 
collected. The Arabinofuranosidase was eluted at 130-150 Mm NaCl. 

The fractions containing the arabinofuranosidase were combined and desalted using a 
50x200 mm G-25 SEPHAROSE Superfine (Pharmacia). The column was eluted with 
distilled water. 

After desalting the enzyme was concentrated using High-Trap spin columns. Next the 
concentrated and desalted fractions were subjected to gel filtration on a 50x600 mm 
SUPERDEX 50 column. The sample was loaded and the column was eluted with 0.2 M 
Phosphate buffer pH 7.0 plus 0.2 M NaCl, and fractions of the eluate were collected. 

The fractions containing arabinofuranosidase were combined and desalted and 
concentrated as described above. The combined fractions were loaded on a 16X100 mm 
Phenylsepharose High Performance column (Pharmacia), equilibrated with 50 mM 
Phosphate buffer pH 6.0, containing 1.5 M (NH 4 ) : S0 4 . A gradient where the (NH 4 ) : SO, 
concentration was varied from 1 .5 - 0 M was applied and the eluate collected in fractions. 
The fractions containing Arabinofuranosidase were combined. The purity of the 
arabinofuranosidase was evaluated by SDS-PAGE using the Phast system gel 
(Pharmacia). 
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Characterization 

The molecular weight of the purified arabinofuranosidase was determined by mass 
spectrometry using laser desorption technology. The MW of the arabinofuranosidase was 
found to be 33.270 D ± 50 D. 

The pi value was determined by use of a Broad pi Kit (Pharmacia). The 
arabinofuranosidase has a pi value of about 3.7. 

After SDS-PAGE analysis, treatment PAS reagent showed that the arabinofuranosidase 
was glycosylated. The PAS staining was done according to the procedure of I. Van- 
Seuningen and M. Davril (1992) Electrophoresis 13 pp 97-99. 
Activity Studies 

Activity of AbfC as a function of water soluble pentosan (WSP) concentrations (mg/ml) 
was determined. The results are shown in Figure 21. The results show that AbfC 
activity reached maximum at substrate concentration of 8 mg/ml WSP. 

pH Activity Studies 

The effect of pH on the activity of the arabinofuranosidase of the present invention was 
investigated using water soluble pentosan (10 mg/ml) from wheat as a substrate in 50 mM 
citric acid sodium phosphate buffer. The incubation time was 15 minutes. The 
arabinofuranosidase of the present invention was observed to have a wide pH optima 
range of from about 2.5 to about 7.0 (see Figure 20). more especially from about 3.3 to 
about 4.6, more especially about 4. 

Temperature Activity Studies 

The effect of temperature on the activity of the arabinofuranosidase of the present 
invention was investigated using water soluble pentosan (10 mg/ml) from wheat as a 
substrate in 50 mM sodium acetate at a pH of 5.0. The incubation time was 15 minutes. 
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The arabinofuranosidase of the present invention was observed to have an optimal activity 
at a temperature of from about 40°C to about 60°C, more especially from about 45°C to 
about 55°C, more especially about 50°C (Figure 22). The enzyme is still active at about 
10°C and showed residual activity at 70°C and 80°C. 

Amino acid sequencing of the arabinofuranosidase 

The enzyme was digested with endoproteinase Lys-C sequencing grade from Boehringer 
Mannheim using a modification of the method described by Stone & Williams 1993 
(Stone, K.L. and Williams, K.R. (1993). Enzymatic digestion of Proteins and HPLC 
Peptide Isolation. In : Matsudaira P. (Editor). A practical Guide to Protein and Peptide 
Purification for Microsequencing. Second Edition. Academic Press, San Diego 1993. pp 
45-73). 



15 



20 



25 



Freeze dried 0-arabinofuranosidase (0.4 mg) was dissolved in 50 M l of 8M urea, 0.4 M 
NH 4 HC0 3 , pH 8.4. After overlay with N 2 and addition of 5 yA of 45 Mm DTT, the 
protein was denatured and reduced for 15 min at 50°C under N : . After cooling to RT, 
5 ii\ of 100 Mm iodoacetamide was added for the cysteines to be derivatised for 15 min 
at RT in the dark under N 2 . Subsequently, 90 M l of water and 5 M g of endoproteinase 
Lys-C m 50 M l of 50 Mm Tricine and 10 mM EDTA, pH 8.0, was added and the 
digestion was carried out for 24h at 37°C under N = . The resulting peptides were 
separated by reversed phase HPLC on a VYDAC C18 column (0.46 x 15 cm; 10 M m; 
The Separations Group; California) using solvent A: 0.1% TFA in water and solvent B: 
0. 1 % TFA in acetonitrile. Selected peptides were rechromatographed on a Develosil CI 8 
column (0.46 x 10 cm; 3 M m) using the same solvent system prior to sequencing on an 
Applied Biosystems 476A sequencer using pulsed-liquid fast cycles. 

The following peptide sequences were found: 
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SEQ I.D No. 4 
SEQ I.D. No. 5 
SEQ I.D. No. 6 
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SEQ I.D. No. 7 
SEQ I.D. No. 8 

Isolation of a PCR clone of a fragment of the gene 

PCR primers were synthesised using an Applied Biosystems DNA synthesiser model 392. 
In this regard, PCR primers were synthesized from one of the found peptide sequences, 
namely SEQ ID No. 5. The primers were: 



10 One primer from EMTAQA (reversed) 

SEQ ID NO. 9 GCY TGN GCN GTC ATY TC 

17 mer 64 mix 

15 One primer from MIVEAIG 

SEQ ID NO. 10 ATG ATH GTN GAR GCN ATH GG 

20 mer 288 mix 

20 PCR amplification was performed with 100 pmol of each of these primers in 100 fi\ 
reactions using Amplitaq polymerase (PERKIN ELMER). The following program was: 



TIME 



25 



30 



STEP 


TEMP 




1 


94°C 


2 min 


2 


94°C 


1 min 


3 


55°C 


2 min 


4 


72°C 


2 min 


5 


72°C 


5 min 


6 


5°C 


SOAK 



Steps 2-4 were repeated for 40 cycles. 
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PCR reactions were performed on a PERKIN ELMER DNA Thermal Cycler. 



A 100 bp amplified fragment was isolated and cloned into a pT7-Blue T-vector, according 
to the manufacturers instructions (Novagen). 

Isolation of A. niger genomic DNA 

Ig. of frozen A. niger mycelium was ground in a mortar under liquid nitrogen. Follow- 
ing evaporation of the nitrogen cover, the ground mycelium was extracted with 15ml of 
an extraction buffer (lOOmM Tris-Hcl, pH 8.0, O.50mM EDTA, 500mM NaCL lOmM 
ft-mercaptoethanol) containing 1ml 20% sodium dodecyl sulphate. After incubation at 
65°C for 10 min. 5ml 5M KAc. pH 5.0. was added and the mixture further incubated, 
after mixing, on ice for 20 mins. After extraction, the mixrure was centrifuged for 20 
mins. and the supernatant mixed with 0.6 vol. isopropanol to precipitate the extracted 
DNA. After further centrifugation for 15 mins. the DNA pellet was dissolved in 0.7 ml 
TE (lOmM Tris, HC1 pH 8.0, ImM EDTA) and precipitated with 75 ^1 3M NaAc, pH 
4.8, and 500 ^1 isopropanol. 

After centrifugation the pellet was washed with 70% ETOH and dried under vacuum. 
The DNA was dissolved in 200 jxl TE and stored at -20°C. 

Construction of a library 

20 ^g genomic DNA was partly digested with Tsp509I, which gives ends which are 
compatible with EcoRl ends. The digested DNA was separated on a 1 % agarose gel and 
fragments of 4-10 kb was purified. A XZAPII EcoRll CI AP kit from Stratagene was used 
for library construction according to the manufacturers, instructions. 2 ^1 of the ligation 
(totally 5 /zl) was packed with Gigapack Gold II packing extract from Stratagene. The 
library contained 650,000 independent clones. 
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Screening of the library 

2 X 50.000 pfu was plated on NZY plates and plaquelifts were done on Hybond N sheets 
(Amersham). Plaquelifts were done in duplicates. The sheets were hybridized with the 
5 PCR clone labelled with 32 P dCTP (Amersham) using Ready-to-go labelling kit from 
Pharmacia. Positive clones were reckoned only when hybridization was detected on both 
sheets. The gene was sequenced, and the found sequence showed that all of the peptides 
sequenced were coded by the found sequence. 

10 Sequence information 

SEQ. ID. No. 12 presents the promoter sequence, the enzyme coding sequence, the 
terminator sequence and the signal sequence and the amino acid sequence of the enzyme 
of the present invention. 

15 

Arabinofuranosidase assay 

Two different arabinoxylan preparations from wheat flour, Wheat Insoluble Pentosan 
(WIP) and Wheat soluble Pentosan (WSP), were degraded with the arabinofuranosidase 

20 enzyme of the present invention alone and in combination with an endoxylanase purified 
from A. niger. The assays were done on 1% substrate in 50 Mm 50 Mm Na-acetate 
buffer at pH 5.0. The reactions were performed at 30 °C for 2.5 hours. The reactions 
were stopped by addition of 3 vol. ethanol which precipitates the high molecular weight 
material. The samples were centrifuged and the supernatants were collected, dried under 

25 vacuum and resuspended in 0.5 ml distilled water. The samples were diluted 1:1 in 
water and analysed on a Chromopack Carbohydrate Pb column (300X7.8 mm, cat. 
29010) using Shimadzu C-R4A Chromatopac HPLC system using a Shimadzu RI D-6A 
refractive index detector in accordance with the suppliers instructions. 

30 The column was calibrated using a standard composed of 0.48 mg/ml xylotriose, 0.48 
mg/ml xylobiose, 0.60 mg/ml xylose and 0.58 mg/ml L-arabinose. The peaks were 
identified and quantified using the software supplied with the equipment. 
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Results - Liberated saccharides from Wheat Insoluble Pentosan 

Substrate 1 % WIP in 50 Mm Na-acetate buffer pH 5.0. Values are expressed in mg/ml 





xylotriose 


xylobiose 


xylose 


arabinose 


no enzyme 


0.0 


0.0 


0.0 


0.0 


abfC 


0.0 


0.0 


0.0 


0.11 


xyl 


0.09 


0.14 


0.0 


0.0 


abfC + xyl 


0.37 


0.41 


0.0 


0.30 



abfC denotes the enzyme according to the present invention; and xyl denotes the xylanase 
described before. 

Results - Saccharides liberated from Wheat Soluble Pentosan 



Substrate 1 % WSP in 50 Mm Na-acetate buffer pH 5.0. Values are expressed 
mg/ml. 



in 





xylotriose 


xylobiose 


xylose 


arabinose 


no enzyme 


0.0 


0.0 


0.0 


0.0 


abfC 


0.0 


0.0 


0.0 


0.30 


xyl 


0.08 


0.14 


0.0 


' 0.0 


abfC + xyl 


0.42 


0.47 


0.0 


0.42 



abfC denotes the enzyme according to the present invention; and xyl denotes the xylanase 
described before. 
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Figure 17 shows HP-TLC profiles of the AbfC enzyme acting synergistically with 
Xylanase A. In this Figure, the following abbreviations are used: water-soluble pentosan 
(WSP): water- insoluble pentosan (WIP); and oat xylan as substrate. The standards were: 
X- xylose: XV xylobiose: X,- xyiotriose; A- arabinose. 

Figure 18 shows the HPLC analysis of hydrolysis products using 1% oat spelt xylan as 
the substrate. Figure 18(a) and Figure 18(b) show the products when the AbfC enzyme 
and the xylanase enzyme respectively were used alone. Figure 18(c) show the products 
when the AbfC enzyme and the xylanase enzyme when combined. 

The results of these experiments provide two important findings. 

First the enzyme of the present invention liberates arabinose, in particular L-arabinose, 
from arabinoxylan. 

Second the combined actions of the enzyme according to the present invention with the 
endoxylanase is significantly higher than the sum of their individual action. Accordingly, 
the two enzymes affect each others enzymatic activities in a synergistic fashion. 

Induction of the AbfC gene: Identification of inducers 

The regulation of transcription of the AbfC encoding gene of Aspergillus niger was 
studied using a strain containing a fusion of the AbfC promoter to the ^-glucuronidase 
encoding gene (uid A) of E coli. 

GUS producing transformants were grown on different carbon sources and 'assayed both 
qualitatively and quantitatively for the ability to hydrolyse p-nitrophenol glucuronide. 



The results are shown below: 
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CARBON SOURCE GUS ACTIVITY AFTER 24 HOURS INDUCTION 
0%) (units/mg) 



xylose 


12.37 


xyfitol 


1.49 


arabinose 


6.66 


arabitol 


5.30 


glucose 


0.70 


cellubiose 


0.95 


xylo-oligomer 70 


17.26 


glucopyranoside 


0.40 


methyl-xylopyranoside 


24.20 


xyloglucan 


1.00 


pectin 


0.27 


arabinogalactan 


2.60 


arabitol 4- glucose 


2.20 



The results show that the AbfC promoter is switched on after 24 hours when grown in 
the presence of xylose, xylo-oligomer 70, methyl-xylopyranoside. arabinose and arabitol. 
These studies also suggest that methyl-xylopyranoside is the natural and strongest inducer 
of this promoter. 

The AbfC promoter is strongly repressed by glucose and is therefore under carbon 
catabolite repression. However, unlike all the published promoters for 
arabinofuranosidases, which are induced by arabinose and arabitol, the AbfC promoter 
of the present invention is regulated strongly by the intermediates in xylose metabolism. 
Accordingly, the present invention also covers an arabinofuranosidase promoter wherein 
the promoter is inducible by an intermediate in xylose metabolism. 
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Effects of different promoter deletions on the regulation of the expression of the 
AbfC gene 

To study the regulation at the molecular level, experiments were set up to detect possible 
upstream regulating sequences required for expression of the AbfC gene. A series of 
plasmids with deletions in the 5' upstream region of the gene was constructed (see Figure 
12). The E coli uid A gene was used as the reporter gene and a qualitative GUS assay 
was performed. 

The results indicated that the truncated AbfC promoter of 590 bp contains sufficient 
information for the inducibility of the AbfC gene and its regulation. Deletion of 100 bps 
sequence from the Xma 111 to the BamHl sites of the promoter led to a reduction in 
activity of this promoter. Therefore, this 100 bps area is important for good levels of 
gene expression. Deletion of 290 bps before the ATG identified this region to be 
important but not sufficient to abolish the activity of this promoter. All the transformants 
analysed containing this promoter construct showed very pale blue when tested (+- 
GUS). This region is as follows: 

-170 TCATCC AATAT 

As seen, this region contains the CCAAT element and is a putative target for a general 
transcriptional activator. This sequence is similar to the nuclear protein binding sites 
found in two starch inducible promoters: the Aspergillus mger glucoamylase gene and the 
Aspergillus oryzae amylase gene as well as the amdS gene of Aspergillus nidulans. 
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HETEROLOGOUS PR OTEIN PRODUCTION USING ASPERGILLUS NIGER 
TRANSFORMED WITH THE AbfC PROMOTER AND THE AbfC SIGNAL 
SEQUENCE 



Transformation of Aspergillus Niger 



The protocol for transformation of A. niger was based on the teachings of Buxton. F. P.. 
Gwynne D.I.. Davis, R.W. 1985 (Transformation of Aspergillus niger using ihe argB gene 
of Aspergillus nidulans. Gene 37:207-214), Daboussi.M. J. . Djeballi.A., Gerlinger, C. 
Blaiseau, PL., Cassan, M., Lebrun, M.H., Parisot, D., Brygoo.Y. 1989 
(Transformation of seven species of filamentous fungi using the nitrate reductase gene of 
Aspergillus nidulans. Curr. Genet. 15:453-456) and Punt, P.J., van den Hondel, 
C. A. M.J.J. 1992 (Transformation of filamentous fungi based on hygromycin B and 
Phleomycin resistance markers. Meth. Enzym. 216:447-457). 

For the purification of protoplasts, spores from one PDA (Potato Dextrose Agar - from 
Difco Lab. Detroit) plate of fresh sporulated N400 (CBS 120.49. Centraalbureau voor 
Schimmelculrures, Baam) (7 days old) are washed off in 5-10 ml water. A shake flask 
with 200 ml PDC (Potato Dextrose Broth, Difco 0549-17-9, Difco Lab. Detroit) is 
inoculated with this spore suspension and shaken (250 rpm) for 16-20 hours at 30 °C. 

The mycelium is harvested using Miracloth paper and 3-4 g wet mycelium are transferred 
to a sterile petri dish with 10 ml STC (1.2 M sorbitol, 10 mM Tris Hcl pH 7,5. 50 Mm 
CaCI 2 ) with 75 mg lysing enzymes (Sigma L-2265) and 4500 units lyticase (Sigma L- 
8012). 

The mycelium is incubated with the enzyme until the mycelium is degraded and the 
protoplasts are released. The degraded mycelium is then filtered through a sterile 60 nm 
mesh filter. The protoplasts are harvested by centrifugation 10 min at 2000 rpm in a 
swing out rotor. The supernatant is discarded and the pellet is dissolved in 8 ml 1.5 M 
MgSO„, and then centrifuged at 3000 rpm for 10 min. 
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The upper band, containing the protoplasts is transferred to another rube, using a transfer 
pipette and 2 ml 0.6 M KC1 is added. Carefully 5 ml 30% sucrose is added on the top 
and the rube is centrifuged 15 min at 3000 rpm. 

The protoplasts, lying in the interface band, are transferred to a new rube and diluted 
with 1 vol. STC. The solution is centrifuged 10 min at 3000 rpm. The pellet is washed 
twice with STC, and finally solubilized in 1 ml STC. The protoplasts are counted and 
eventually concentrated before transformation. 

For the transformation, 100 M l protoplast solution (10 6 -10 7 protoplasts) are mixed with 
10 M l DNA solution containing 5- 10 M g DNA and incubated 25 min at room 
temperature. Then 60 % PEG-4000 is carefully added in portions of 200 M l, 200 ^ and 
800 fxl. The mixture is incubated 20 min at room temperature. 3 ml STC is added to the 
mixture and carefully mixed. The mixture is centrifuged 3000 rpm for 10 min. 

The supernatant is removed and the protoplasts are solubilized in the remaining of the 
supernatant. 3-5 ml topagarose is added and the protoplasts are quickly spread on 
selective plates. 

AbfC promoter and heterologous gene expression 

The expression vector pXP-Amy (Figure 13) contains the 2.1 kb a-amylase encoding 
gene from Thermomyces lanuginosus cloned downstream of the AbfC promoter (2. 1 kb) 
and upstream of the Xylanase A terminator. This vector together with the hygromycm 
gene as a selectable marker was used for co-transformation experiments to test the 
functionality of the AbfC promoter. 

The best transformant was accumulated in shake flask experiments at least 1 gram per 
litre of a-amylase in the culture media. Starch degrading activity was then detected 
within 48 hours and a peak of enzyme activity is observed at 4 days of growth on sugar 
beet pulp and wheat bran (Figure 15). 
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AbfC signal sequence functions in protein secretion 

An expression construct containing the signal peptide of the AbfC gene translationally 
fused to the mature a-amylase from T. lanuginosus was prepared and expression of this 
construct in the production strains was observed. In this regard, the translational fusion 
construct pXPXss-Amy (Figure 14) was placed under the transcriptional control of the 
AbfC promoter and the xylanase A termination signal. The incorporation of an 
endogenous signal peptide resulted in increased detectabiiity of co-transformants 
expressing both amylase and the hygromycin resistance marker. The endogenous signal 
peptide directed the secretion of amylase out of the cell. 



Substrate Specificity of AbfC Protein 



The substrate specificity of the purified AbfC was determined using arabinose containing 
hemicelluloses: arabinoxylans from wheat, oat and larch, branched and debranched 
arabinans; arabinogalactan, sugar beet pectin, and xyloglucan. 

The HPLC and HP-TLC results are shown in Figure 16, in which the following 
abbreviations are used: WSP - water-soluble pentosan, WIP - water-insoluble pentosan, 
AG - arabinogalactan, deB-A - debranched arabinan. The standards used were: A- 
arabinose, X- xylose. 

The results indicate that arabinose is the hydrolysis product from arabinoxylans. No 
hydrolysis products were released from arabinogalactan, debranched arabinan or 
xyloglucan. Arabinose was released as a hydrolysis product from branched arabinan. 
AbfC is therefore a 1,2/1,2 debranching enzyme and it has no activity towards linear 1,5 
or-Iinked L-arabinofuranose residues found in debranched arabinans and arabinogalactan. 
This enzyme also releases a product when pectin is used as the substrate. It is believed 
that this product is an arabinose containing ferulic acid or an arabinobiose. 
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The results for the substrate specificity studies also suggest that the enzyme of the present 
invention couid be used to reduce the viscosity of feeds. In this regard, the enzyme 
would reduce the viscosity of branched substrates by removing the branches but not the 
backbone of that substrate. This is in contrast to the known viscosity modifiers which 
degrade the substrate backbone. 

Accordingly, the present invention covers a process of reducing the viscosity of a 
branched substrate wherein the enzyme degrades the branches of the substrate but not the 
backbone of the substrate. 

In particular, the present invention covers the use of the enzyme of the present invention 
as a viscosity modifier. 

In this regard, an experiment was carried out to investigate the reduction of viscosity of 
the water-soluble pentosan fraction from wheat flour by arabinofuranosidase. In this 
experiment, 6 ml water-soluble pentosan was incubated with 100 /xl of AbfC for 20 
hours, 20°C at pH 5.5. 

The results (see Figure 19) show that the enzyme of the present invention can be used 
to reduce the viscosity of pectins, especially pectins that are used in beverages - such as 
fruit juices. 

Accordingly, the present invention covers the use of the enzyme of the present invention 
to reduce the viscosity of pectin. 

ANTIBODY PRODUCTION 

Antibodies were raised against the enzyme of the present invention by injecting rabbits 
with the purified enzyme and isolating the immunoglobulins from antiserum according 
to procedures described according to N Harboe and A Ingild ("Immunization. Isolation 
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of Immunoglobulins, Estimation of Antibody Titre" In A Manual of Quantitative 
Immunoelectrophoresis, Methods and Applications, N H Axelsen. et al (eds.). 
Universitetsforlaget, Oslo, 1973) and by T G Cooper ("The Tools of Biochemistry" . John 
Wiley & Sons, New York, 1977). 

5 

SUMMARY 

Even though it is known that Aspergillus niger produces arabinofuranosidases, the present 
invention provides a novel and inventive arabinofuranosidase. as well as the coding 
10 sequence therefor and the promoter for that sequence. An important advantage of the 
present invention is that the enzyme can be produced in high amounts. 

In addition, the promoter and the regulatory sequences (such as the signal sequence and 
the terminator) can be used to express or can be used in the expression of GOIs in 
15 organisms, such as in A niger. 

The arabinofuranosidase of the present invention is different from the 
arabinofuranosidases previously known. In this regard, the previous described 
arabinofuranosidases - such as those of EP-A-0506190 - are characterised by their ability 
20 to degrade arabinan, and are assayed using p-nitrophenyl-arabinoside. 

The arabinofuranosidase of the present invention does not degrade arabinan, and only a 
minor activity is seen on p-nitrophenyl-arabinoside. 

25 In contrast, the arabinofuranosidase of the present invention is useful for degrading 
arabinoxylan. Therefore, the arabinofuranosidase of the present invention is quite 
different from the previous isolated arabinofuranosidases. 

More in particular, the enzyme of the present invention is capable of specifically cleaving 
30 arabinose from the xylose backbone of arabinoxylan. 
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Tbe enzyme of the present invention is useful as it can improve processes for preparing 
foodstuffs and feeds as well as the foodstuffs and feeds themselves. For example, the 
enzyme of the present invention may be added to animal feeds which are rich in 
arabinoxylans. When added to feeds (including silage) for monogastic animals (e.g. 
poultry or swine) which contain cereals such as barley, wheat, maize, rye or oats or 
cereal by-products such as wheat bran or maize bran, the enzyme significantly improves 
the break-down of plant cell walls which leads to better utilization of the plant nutrients 
by the animal. As a consequence, growth rate and/ or feed conversion are improved. 
Moreover, arabinoxylan-degrading enzymes may be used to reduce the viscosity of feeds 
containing arabinans. The arabinoxylan-degrading enzyme may be added beforehand to 
the feed or silage if pre-soaking or wet diets are preferred. 

Of particular benefit is the use of the enzyme according to the present invention in 
combination with a xylanase, especially an endoxylanase. 

A possible further application for the enzyme according to the present invention is in the 
pulp and paper industry. The application of xylanases is often reported to be beneficial 
in the removal of lignins and terpenoids from the cellulose and hemicellulose residues of 
a hemicellulose backbone, an essential step in the processing of wood, wood pulp or 
wood derivative product for the production of paper. The addition of arabinoxylan- 
degrading enzymes, produced according to the present invention, to the xylanase 
treatment step should assist in the degradation of an arabinan-containing hemicellulose 
backbone and thus facilitate an improved, more efficient removal of both lignins and 
terpenoids. The application of arabinoxylan-degrading enzymes should be particularly 
advantageous in the processing of soft woods in which the hemicellulose backbone 
contains glucuronic acid. 

The enzyme according to the present invention is also useful as it acts in a synergistic 
manner with endoxylanase (see results presented above). 

Other modifications of the present invention will be apparent to those skilled in the an 
without departing from the scope of the invention. 
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SEQUENCE LISTINGS 

SEO ID NO: i 
ENZYME SEQUENCE 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 296 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Lys Cys Ser Leu Pro Ser 
1 5 

Ser Tyr Ser Trp Ser Ser Thr Asp Ala Leu Ala Thr Pro Lys Ser Gly 

10 15 20 

Trp Thr Ala Leu Lys Asp Phe Thr Asp Val Val Ser Asp Gly Lys His 

25 30 35 

He Val Tyr Ala Ser Thr Thr Asp 61 u Ala Gly Asn Tyr Gly Ser Met 

40 45 50 

Thr Phe Gly Ala Phe Ser Glu Trp Ser Asn Met Ala Ser Ala Ser Lys 
55 60 65 70 

Thr Ala Thr Pro Tyr Asn Ala Val Ala Pro Thr Leu Phe Tyr Phe Lys 

75 80 85 

Pro Lys Ser lie Trp Val Leu Ala Tyr Gin Trp Gly Ser Ser Thr Phe 

90 95 ioo 

Thr Tyr Arg Thr Ser Gin Asp Pro Thr Asn Val Asn Gly Trp Ser Ser 

105 HO H5 

Glu Lys Ala Leu Phe Thr Gly Lys Leu Ser Asp Ser Ser Thr Gly Ala 

120 _ 125 130 

He Asp Gin ihr Val He Gly Asp Asp Thr Asn Met Tyr Leu Phe Ph* 
135 i4 ° 145 156 

Ala Gly Asp Asn Gly Lys He Tyr Arg Ser Ser Met Ser He Asp Glu 
155 160 ' 165 
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Phe Pro Gly Ser Phe Gly Ser Gin Tyr 61 u Glu He Leu Ser Gly Ala 

170 175 180 

Thr Asn Asp Leu Phe Glu Ala Val Gin Val Tyr Thr Va 1 Asp Gly Gly 

185 190 195 

Glu Gly Asn Ser Lys Tyr Leu Met He Val Glu Ala He Gly Ser Thr 

200 205 210 

Gly His Arg Tyr Phe Arg Ser Phe Thr Ala Ser Ser Leu Gly Gly Glu 
215 220 225 230 

Trp Thr Ala Gin Ala Ala Ser Glu Asp Lys Pro Phe Ala Ala Lys Pro 

235 240 245 

Thr Val Ala Pro Pro Gly Pro Lys Thr Leu Ala Met Val Thr Trp Phe 

250 255 260 

Ala Thr Thr Leu He Lys Pro * 
265 270 
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SEQ ID NO: 2 

NUCLEOTIDE CODING SEQUENCE 



AAA TGC TCT CTT 
GCA ACT CCT AAG 
GTC TCT GAC GGC 
GGA AAC TAT GGC 
ATG GCA TCT GCT 
ACC CTG TTC TAC 
TGG GGC TCC AGC 
GTC AAC GGC TGG 
GAC TCA AGC ACC 
AAT ATG TAT CTC 
AGC ATG TCC ATC 
GAA ATT CTG AGT 
TAC ACG GTT GAC 
GAG GCG ATC GGG 
AGC AGT CTC GGT 
CCC TTC GCA GCA 
GCC ATG GTG ACT 



CCA 


TCG 


TCC 


TAT 


TCA 


GGA 


TGG 


ACC 


AAA 


CAT 


ATC 


GTC 


TCG 


ATG 


^\-\^ 




AGC 


AAG 


ACA 


GCC 


TTC 


AAG 


CCG 


AAA 


A.CA 


TTC 


ACC 


TAC 


TCG 


TCG 


GAG 


AAG 


GGT 


GCC 


ATT 


GAC 


TTC 


TTT 






GAT GAA 


TTT 


CCC 


GGT 


GCC 


ACC 


AAC 


GGC 


GGC 


GAG 


GGC 


TCC 


ACT 


GGA 


CAT 


GGA GAG 


TGG 


ACA 


AAG 


CCA 


ACA 


GTG 


TGG 


TTC 


GCA 


ACA 



AGT TGG AGT TCA 
GCA CTG AAG GAC 
TAT GCG TCC ACT 
GGC GCT TTC TCA 
ACC CCC TAC AAT 
AGC ATC TGG GTT 
CGC ACC TCC CAA 
GCG CTT TTC ACC 
CAG ACG GTG ATT 
GAC AAC GGC AAG 
GGA AGC TTC GGC 
GAC CTA TTC GAG 
AAC AGC AAG TAC 
CGT TAT TTC CGC 
GCC CAG GCG GCA 
GCG CCA CCT GGA 
ACC CTG ATC AAA 



ACC 


GAT 


GCT 


CTC 




ACT 


GAT 


GTT 


ACT 


GAT 


GAA 


111 Li 


GAG 


TGG 


T~G 


AA<^ 


GCC 


GTG 


GCT 


CCT 


CTG 


GCC 


TAC 


CAA 


GAT 


CCC 


ACC 


AAT 


GGA 


AAA 


CTC 




GGC 


GAC 


GAT 


ACG 


A 1 L 


TAT 

1 At 


CuA 


1 


AGC 


CAG 


TAC 


GAG 


GCG 


GTC 


CAA 


GTG 


CTC 


ATG 


ATC 


GTT 


TCC 


TTC 


ACG 


GCC 


AGT 


GAG 


GAT 


AAA 


CCG 


AAG 


ACA 


TTA 


CCA 


TGA 
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SEO ID NO: 3 
PROMOTER SEQUENCE 

CTGCAGAAGA TGGCAGTCGC CACAGCCGAT CACCCGATCC ATACTGGATG TTGTAACTTG 60 
GAGACAGCCT GCAGA7GCTC TGATGAAGGT CTGCAAATA.G TTCCTGGACC TCGATAGTGA 120 



180 



AGTATACCGA TTCGTCAATG TTGTATATCC AGCCACTTTG AAAGTACCAA CTTTTAGTTC 

GATTGATCAG AATACTTTTG GTGTGTAACA TTGACAAGCC AAATTATCAA. TCTCT7CTAC 240 

CGGTAAGGTG TCAACTACCC GGCCGAAAGT ACCGGAAGGT CGTGGTGTTT TAAGGTGAAA 300 

CAACTATCAG GGCGGCAATG TGTCAAAGTA GAACCAGTTT GCTTAGCGCC ATTAGGATCC 360 

ACGCCTAGAC CCTTGATGCC CGGGAGTTAT CCGTCCTGTC ACAGCAATTA TTTCCCCGAG 420 

TCTACTGCCG AAGAACAGCC ATTGTGGCGT ACTCACGGAA TTACCCACTG TGTAGGGTAG 480 

TCTTGAACGC CGTTCTAGAC ACGGCAACGC TCCGGTGGAC GATCGTTTCT GGCTAATGTA 540 

CTCCGiAGTT TAGGCAGCAT GCTGATCATC TTCCCCCTAG GGAAAGGCCC CTGAATAGTG 600 

CGCCAAAATG AGCTTGAGCA AAGGAATGTT CTTTCTAAGC CAAAGTGAGG GAAATAACCA 660 

AGCAGCCCAC TTTTATCCGA AACGTTTCTG GTGTCATCCA ATATGGATAA ATCCCGATTG 720 

TTCTTCTGCA CATATCTCTA TTGTCATAAG TGCAACTACA TATATTTGAA CATGGTTTGG 780 
TCCTCTTTCC AAGTTATTCG TTCTCCGTGA CCAGCGATTT CAGCCATTGA TTCTTTTGTT 
TCTTTCCCCG CGGATAAACT CATACGAAG 



840 
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INFORMATION FOR SEO ID NO: 

(i) SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii ) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: N- terminal 
(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 4: 

Lys Cys Ser Leu Pro Ser Ser Tyr Ser Trp Ser Ser Thr Asd Ala Leu 
1 5 10 15 

Ala Thr Pro Lys 
20 

INFORMATION FOR SEQ ID NO: 5: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 
(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 5: 

Tyr Leu Met He Va 1 Glu Ala He Gly Ser Thr Gly His Arg Tyr Phe 

1 5 10 15 

Arg Ser Phe Thr Ala Ser Ser Leu Gly Gly Glu Met Thr Ala Gin Ala 

20 25 30 

Ala Ser Glu Asp Lys Pro Phe Xaa Gly 
35 40 
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INFORMATION FOR SEO ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ser He Trp Val Leu Ala Tyr Gin Trp Gly Ser Ser Thr Phe Thr Tyr 
1 5 10 15 

Arg Thr Ser Gin Asp Pro Thr Asn Val 
20 25 

INFORMATION FOR SEQ ID NO: 7: 

(1 ) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 amino acids 
(8) TYPE: amino acid 
(C) STRANDEDNESS: single 



(D) TOPOLOGY 
(ii) MOLECULE TYPE 
(v) FRAGMENT TYPE 



1 i near 
peptide 
internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asp He Val Tyr Ala Ser Thr Thr Asp Glu Ala Gly Asn Tyr Gly Ser 
1-5 10 15' 

Met Thr Phe Gly Ala Phe Ser Glu Xaa Ser Asn Met Ala Ser 
20 25 30 



WO 96/29416 



PCT/EP96/01009 



51 

INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEO ID NO: 8: 
He Tyr Arg Ser Ser Met Ser lie Asp Glu Phe Pro Gly Ser Phe Gly 
15 10 15 

Ser Gin Tyr Glu Glu He Leu Ser Gly Ala Thr Asn Asp Leu Phe Glu 

20 25 30 

Ala Val Gin Val Tyr Thr Val Asp Gly 
35 40 



INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "OLIGONUCLEOTIDE" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GCYTGNGCNG TCATYTC 17 

INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "OLIGONUCLEOTIDE" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ATG ATH GTN GAR GCN ATH GG 20 
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INFORMATION FOR SEQ ID NO: ii: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR fragment" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ATGATTGTGG AGGCGATCGG GTCCACTGGA CATCGTTATT TCCGCTCCTT CACGGCCAGC 60 
AGTCTCGGTG GAGAGATGAC CGCACAGGC 89 




PCT/EP96/01009 



INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus niger 

(B) STRAIN: 3M43 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 870. .1757 
(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 870. .947 
(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION :948. .1754 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CTGCAGAAGA TGGCAGTCGC CACAGCCGAT CACCCGATCC ATACTGGATG TTGTAACTTG 60 

GAGACAGCCT GCAGATGCTC TGATGAAGGT CTGCAAATAG TTCCTGGACC TCGATAGTGA 120 

AGTATACCGA TTCGTCAATG TTGTATATCC AGCCACTTTG AAAGTACCAA CTTTTAGTTC 180 

GATTGATCAG AATACTTTTG GTGTGTAACA TTGACAAGCC AAATTATCAA TCTCTTCTAC 240 
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CGGTAAGGTG TCAACTACCC GGCCGAAAGT ACCGGAAGGT CGTGGTGTTT TAAGGTGAAA 300 
CAACTATCAG GGCGGCAATG 7GTCAAAGTA GAA.CCAGTTT GCTTAGCGCC A7TAGGATCC 360 
ACGCCTAGAC CCTTGATGCC CGGGAGTTAT CCGTCCTGTC ACAGCAATTA TTTCCCCGAG 420 
TCTACTGCCG AAGAACAGCC ATTGTGGCGT ACTCACGGAA TTACCCACTG TGTAGGGTAG 480 
TCTTGAACGC CGTTCTAGAC ACGGCAA.CGC TCCGGTGGAC GATCGTTTCT GGCTAATG7A 540 
CTCCGTAGTT TAGGCAGCAT GCTGATCATC TTCCCCCTAG GGAAAGGCCC CTGAATAGTG 6C0 
CGCCAAA.ATG AGCTTGAGCA AAGGAATGTT CTTTCTAAGC CAAAGTGAGG GAAATAACCA 660 
AGCAGCCCAC TTTTATCCGA AACGTTTCTG GTGTCATCCA ATATGGATAA. ATCCCGATTG 720 
TTCTTCTGCA CATATCTCTA TTGTCATAAG TGCAACTACA TATATTTGAA CATGGTTTGG 780 
TCCTCTT7CC AAGTTATTCG TTCTCCGTGA CCAGCGATTT CAGCCA7TGA TTC7TTTG7T 840 
TCTTTCCCCG CGGATAAACT CATACGAAG ATG AAG TTC TTC AAT GCC AAA GGC 893 

Met Lys Phe Phe Asn Ala Lys Gly 
-26 -25 -20 
AGC TTG CTG TCA TCA GGA ATC TAC CTC ATT GCA TTA ACC CCC TTT GTT 941 
Ser Leu Leu Ser Ser Gly He Tyr Leu He Ala Leu Thr Pro Phe Val 

-15 -10 ' -5 

AAC GCC AAA TGC TCT CTT CCA TCG TCC TAT AGT TGG AGT TCA ACC GAT 989 
Asn Ala Lys Cys Ser Leu Pro Ser Ser Tyr Ser Trp Ser Ser Thr Asp 

1 5 10 

GCT CTC GCA ACT CCT AAG TCA GGA TGG ACC GCA CTG AAG GAC TTT ACT 1037 
Ala Leu Ala Thr Pro Lys Ser Gly Trp Thr Ala Leu Lys Asp Phe Thr 
15 20 25 30 

GAT GTT GTC TCT GAC GGC AAA CAT ATC GTC TAT GCG TCC ACT ACT GAT 1085 
Asp Val Val Ser Asp Gly Lys Hts He Val Tyr Ala Ser Thr Thr Asp 

35 40 45 

GAA GCG GGA AAC TAT GGC TCG ATG ACC TTT GGC GCT TTC TCA GAG TGG 1133 
Glu Ala Gly Asn Tyr Gly Ser Met Thr Phe Gly Ala Phe Ser Glu Trp 

50 55 60 

TCG AAC ATG GCA TCT GCT AGC AAG ACA GCC ACC CCC TAC AA.T GCC GTG 1181 
Ser Asn Met Ala Ser Ala Ser Lys Thr Ala Thr Pro Tyr Asn Ala Val 

65 70 75 

GCT CCT ACC CTG TTC TAC TTC AAG CCG AAA AGC ATC TGG GTT CTG GCC 1229 
Ala Pro Thr Leu Phe Tyr Phe Lys Pro Lys Ser He Trp Val Leu Ala 

80 85 90 

TAC CAA TGG GGC TCC AGC ACA TTC ACC TAC CGC ACC TCC CAA GAT CCC 1277 
Tyr Gin Trp Gly Ser Ser Thr Phe Thr Tyr Arg Thr Ser Gin Asp Pro 
95 100 105 110 
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ACC AAT GTC AAC GGC TGG ICG TCG GAG AAG GCG CTT TTC ACC GGA AAA 1325 
Thr Asn Val Asn Gly Trp Ser Ser Glu Lys Ala Leu Phe Thr Gly Lys 

115 120 125 

CTC AGC GAC TCA AGC ACC GGT GCC ATT GAC CAG ACG GTG ATT GGC GAC 1372 
Leu Ser Asp Ser Ser Thr Gly Ala He Asp Gin Thr Val ile Gly Asp 

130 135 140 

GAT ACG AAT ATG TAT CTC TTC TTT GCT GGC GAC AAC GGC AAG ATC TAC 1421 
Asp Thr Asn Met Tyr Leu Phe Phe Ala Gly Asp Asn Gly Lys ile Tyr 

145 150 155 

CGA TCC AGC ATG TCC ATC GAT GAA TTT CCC GGA AGC TTC GGC AGC CAG 1469 
Arg Se^ Ser Met Ser Ile Asp Glu Phe Pro Gly Ser Phe Gly Ser Gin 

160 165 170 

TAC GAG GAA ATT CTG AGT GGT GCC ACC AA.C GAC CTA TTC GAG GCG GTC 1517 
Tyr Glu Glu Ile Leu Ser Gly Ala Thr Asn Asp Leu Phe Glu Ala Val 
175 180 185 190 

CAA GTG TAC ACG GTT GAC GGC GGC GAG GGC AAC AGC AAG TAC CTC ATG 1565 
Gin Val Tyr Thr Val Asp Gly Gly Glu Gly Asn Ser Lys Tyr Leu Met 

195 200 205 

ATC GTT GAG GCG ATC GGG TCC ACT GGA CAT CGT TAT TTC CGC TCC TTC 1613 
Ile Val Glu Ala He Gly Ser Thr Gly His Arg Tyr Phe Arg Ser Phe 

210 215 220 

ACG GCC AGC AGT CTC GGT GGA GAG TGG ACA GCC CAG GCG GCA AGT GAG 1661 
Thr Ala Ser Ser Leu Gly Gly Glu Trp Thr Ala Gin Ala Ala Ser Glu 

225 230 235 

GAT AAA CCC TTC GCA GCA AAG CCA ACA GTG GCG CCA CCT GGA CCG AAG 1709 
Asp Lys Pro Phe Ala Ala Lys Pro Thr Val Ala Pro Pro Gly Pro Lys 

240 245 250 

ACA TTA GCC ATG GTG ACT TGG TTC GCA ACA ACC CTG ATC AAA CCA TGA 1757 
Thr Leu Ala Met Val Thr Trp Phe Ala Thr Thr Leu He Lys Pro * 
255 260 265 270 

CTGTCGATCC TTGCAACCTC CAGTTGCTCT ATCAGGGCCA TGACCCCCAA CAGCAGTGGC 1817 
GACTACAACC TCTTGCCATG GAAGCCGGGC GTCCTTACCT TGAAGCAGTG ACGAGCTTAT 1877 
CTTTAGTTGC AGATCGTGTT TCTCCTTTCT TCTTCAAGTA GTTTTAGTGG TGGAAGACAG 1927 
CAGAAGGTGG TCATCATCTT AGGCTCAGTT GGGGTGGGCT CCTGCCACGT TTTGTCCATA 1997 
GGCTAGTAAT TTGCACGGAA TTCAGTTCAT TGGCAAGGAG TGCGGTACGA ATACCTGTTT 2057 
TCACAATAGC AATTAGGCCC AGTAGTTATA CTACGTACTG GAATTGAGTA CTCGTAGTAG 2117 
CAAGATTGTT TGCCTCAGAG GGAATGGCCG ACACGTGAGC AAGTCACCTT CATCAGCTAG 2177 



WO 96/29416 




PCT/EP96/01009 



TCGCGT7CCA CATAGACAAT GGTCCAGCTC CAGAGTGGAA TTTGGGCTAC TTTGAACGAT 2237 

GGCCGATTGA ATCGCGCGTC TCCTCAAT7G TATTTAACCA CAATAGGCCA GGTATTGGCA 2297 

TTCACTCTCC GCCTTTGCGG GTGCCGGCAC GAGATGTCTC CTGAAGAAA.C TAGGCAACGA 2357 

GCAGACTGTG GATATGGGAG A7GGTTGACG ATGTGCTTCT TGGTAAATTT GAAGCCTCCA 2417 

GGGCCTCTAG AAAGGCGGGA ATTTAAATCT CAAGTGCCCT AACGTGTCCG ACCACGGTGT 2477 

TGATCATCAT TCATTGAATC GGATAACAGT CTTGGTTCGG AAACTGAACA GGCGGCTCTT 2537 

GAATGACACT CTGGATCC 2555 



(2) INFORMATION FOR SEQ ID NO: 13: 
TERMINATOR SEQUENCE 

CTGTCGATCC TTGCAACCTC CAGTTGCTCT ATCAGGGCCA TGACCCCCAA CAGCAGTGGC 60 

GACTACAACC TCTTGCCATG GAAGCCGGGC GTCCTTACCT TGAAGCAGTG ACGAGCTTAT 120 

CTTTAGTTGC AGATCGTGTT TCTCCTTTCT TCTTCAAGTA GTTTTAGTGG TGGAAGACAG 180 

CAGAAGGTGG TCATCATCTT AGGCTCAGTT GGGGTGGGCT CCTGCCACGT TT7GTCCATA 240 

GGCTAGTAAT TTGCACGGAA TTCAGTTCAT TGGCAAGGAG TGCGGTACGA ATACCTGTTT 300 

TCACAATAGC AATTAGGCCC AGTAGTTATA CTACGTACTG GAATTGAGTA CTCGTAGTAG 360 

CAAGATTGTT TGCCTCAGAG GGAATGGCCG ACACGTGAGC AAGTCACCTT CATCAGCTAG 420 

TCGCGTTCCA CATAGACAAT GGTCCAGCTC CAGAGTGGAA TTTGGGCTAC TTTGAACGAT 480 

GGCCGATTGA ATCGCGCGTC TCC7CAATTG TATTTAACCA CAATAGGCCA GGTATTGGCA 540 

TTCACTCTCC GCCTTTGCGG GTGCCGGCAC GAGATGTCTC CTGAAGAAAC TAGGCAACGA 600 

GCAGACTGTG GATATGGGAG ATGGTTGACG ATGTGCTTCT TGGTAAATTT GAAGCCTCCA 660 

GGGCCTCTAG AAAGGCGGGA ATTTAAATCT CAAGtGCCCT AACGTGTCCG ACCACGGTGT 720 

TGATCATCAT TCATTGAATC GGATAACAGT CTTGGTTCGG AAACTGAACA GGCGGCTCTT 780 

GAATGACACT CTGGATCC 798 



(2) INFORMATION FOR SEQ ID NO: 14 
Signal SEQUENCE 

ATG AAG TTC TTC AAT GCC AAA GGC AGC TTG CTG TCA TCA GGA ATC TAC 
CTC ATT GCA TTA ACC CCC TTT GTT AAC GCC 
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SEO ID NO: 15 

SIGNAL SEQUENCE 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 amino acids 
(8)' TYPE: amino acid 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEO ID NO: 15: 



Met Lys Phe Phe Asn Ala 
Leu Ser Ser Gly He Tyr 
Thr Pro Phe Val Asn Ala 



Lys Gly Ser Leu 10 
Leu lie Ala Leu 20 

26 
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CLAIMS 



1. An enzyme that is obtainable from Aspergillus, wherein the enzyme has the 
following characteristics: 

a. a MW of 33,270 D ± 50 D 

b. a pi value of about 3.7 

c. arabinoxylan degrading activity 

d. a pH optima of from about 2.5 to about 7.0 (more especially from about 
3.3 to about 4.6, more especially about 4) 

e. a temperature optima of from about 40°C to about 60°C (more especially 
from about 45°C to about 55°C, more especially about 50°C); 

wherein the enzyme is capable of cleaving arabinose from the xylose backbone of an 
arabinoxylan. 

2. An enzyme having the sequence shown as SEQ. I D. No. 1 or a variant, 
homologue or fragment thereof. 

3. An enzyme coded by the nucleotide sequence shown as SEQ. I. D. No. 2 or a 
variant, homologue or fragment thereof or a sequence complementary thereto. 

4. A nucleotide sequence coding for the enzyme according to claim 1. 

5. A nucleotide sequence coding for the enzyme according to claim 2. 

6. A nucleotide sequence having the sequence shown as SEQ. LD. No. 2 or a 
variant, homologue or fragment thereof or a sequence complementary thereto. 



7. A nucleotide sequence according to any one of claims 4 to 6 operatively linked 
to a promoter. 
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8. A nucleotide sequence according to claim 7 wherein the promoter is the promoter 
having the sequence shown as SEQ. I.D. No. 3 or a variant, homologue or fragment 
thereof or a sequence complementary thereto. 

9. A promoter having the sequence shown as SEQ. I.D. No. 3 or a variant, 
homologue or fragment thereof or a sequence complementary thereto. 

10. A promoter according to claim 9 operatively linked to a GOI. 

11. A promoter according to claim 10 wherein the promoter is operatively linked to 
a GOI, wherein the GOI comprises a nucleotide sequence according to any one of claims 
4-6. 

12. A terminator having the nucleotide sequence shown as SEQ. I.D. No. 13 or a 
variant, homologue or fragment thereof or a sequence complementary thereto. 

13. A signal sequence having the nucleotide sequence shown as SEQ. LD. No. 14 or 
a variant, homologue or fragment thereof or a sequence complementary thereto. 

14. A construct comprising or expressing the invention according to any one of claims 
1 to 13. 

15. A vector comprising or expressing the invention of any one of claims 1 to 14. 

16. A plasmid comprising or expressing the invention of any one of claims 1 to 15. 

17. A transgenic organism comprising or expressing the invention according to any 
one of claims 1 to 16. 



18. A transgenic organism according to claim 17 wherein the organism is a fungus. 
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19. A transgenic organism according to claim 18 wherein the organism is a 
filamentous fungus, preferably of the genus Aspergillus. 

20. A transgenic organism according to claim 17 wherein the organism is a plant. 

21. A process of preparing an enzyme according to any one of claims 1 to 3 
comprising expressing a nucleotide sequence according to any one of claims 4-8. 

22. A process according to claim 21 wherein the enzyme has the sequence shown as 
SEQ. I.D. No. 1 or a variant, homologue or fragment thereof, and the nucleotide 
sequence has the sequence shown as SEQ. I.D. No. 2 or a variant, homologue or 
fragment thereof or a sequence complementary thereto. 

23. A process according to claim 21 or claim 22 wherein the expression is controlled 
(partially or completely) by use of a promoter according to claim 9. 

24. A process for expressing a GOI by use of a promoter, wherein the promoter is 
the promoter according to claim 9. 

25. Use of an enzyme according to any one of claims 1 to 3 or prepared by a process 
according to any one of claims 21 to 24 to degrade an arabinoxylan. 

26. Use according to claim 24 wherein the enzyme is used in combination with a 
xylanase, preferably an endoxylanase. 

27. A combination of enzymes to degrade an arabinoxylan. the 'combination 
comprising an enzyme according to any one of claims 1 to 3 or prepared by a process 
according to any one of claims 21 to 24 claims; and a xylanase. 



28. Plasmid NCIMB 40703, or a nucleotide sequence obtainable therefrom for 
expressing an enzyme capable of degrading arabinoxylan or for controlling the expression 
thereof or for controlling the expression of another GOI. 
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29. A signal sequence having the sequence shown as SEQ. I.D. No. 15 or a variant, 
homologue or fragment thereof. 

30. The use of the enzyme according to any one of claims 1 to 3 or prepared by a 
5 process according to any one of claims 21 to 24 claims, in the manufacture of a 

medicament or foodstuff to reduce or prevent indigestion and/or increase nutrient 
absorption. 



10 



31. An arabinofuranosidase enzyme having arabinoxylan degrading activity, which is 
immunologically reactive with an antibody raised against a purified arabinofuranosidase 
enzyme having the sequence shown as SEQ. I.D. No. 1. 
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AMY 637 PROMOTER 

SEQUENCE TYPE: Nucleotide 

MOLECULE TYPE: DNA 

ORIGINAL SOURCE: Solanum Tuberosum 

SEQUENCE LENGTH: 2094 

SEQUENCE: 

10 _ _ _20 30 _ 40 

ATTAAGGGGA GCA7AA6 i GC AGC7CAGAAA i ; CACACCTG 

50 60 70 30 

ATATTTTCCC AAAGCCCTCA AAAATGTGAA CAAATCTGCT 

_ _ 90 100 110 120 

AAAAiGiCAG TCAGAAGGAC TGTTCTTTTA GGTTTTCTTC 

130 140 150 160 

iCTCGAGTCA CGAAA.ICAGA TAATATGATA AGAAATTATG 

170 180 190 200 

GAGGATTTAT AATGTATCTG TCTGTTCTTA GGTATAATTA 

210 220 230 240 

TGTGTTCCTT TATGATGTAG TAATGGAATT CTGGGCTTAT 

250 260 270 280 

ATTAAAGGAA CTGAATATAA ATGTTCGCAT TTTAACTGCG 

290 300 310 320 

GAGACTTCGA GTTA6AGCCT TATAATTATG TCTTATCATT 

330 340 350 360 

TTATACTGAG ATCATATTAC AGATGATGAA AGCTGACATT 

370 380 390 400 

GCATTAGTTA TTCTGTTTTA TACAAGTCAT GTAACTGCTG 

410 420 430 440 

CTTGTGAGTT GTGACTGTAA GATAAATTGA TTCAGCCTTC 

450 460 470 480 

TGTGGCATTA GCGGAGATCT GATTATACTC TCATCGTCTT 

490 500 510 520 

ATCTAAGTTG CTCATGCAAC TTTGTCCTTG ATAGTTGGCT 

530 540 550 560 

AATACTACAA CTGGAATTAA GTGTAGTTAT TCGAAATCTC 

570 580 590 600 

TGTTGGAAGT TGCTAAGTGC TTAAGTGCTG GTTATTGTAA 

610 620 630 640 

ACCCCATCCG AGTTATTATA CAGCATCTGG CTGATGAAA.T 

650 660 670 680 

GCTGCTCATT TGCAATGGTG ACATAACCAA. ATGTTAGTAA 

690 700 710 720 

AACATACTAG CTGGTTGAAT GTTAGATGAT TGTTCAACGT 

730 740 750 760 

TACATCTCAC AGAAACCTTA TTATGGATTG ACATGTTAGT 
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FIGURE i CONTINUED 



2/38 



TGA1 



770 

:gam 

810 

\ i : ; r \r 
-. I : I omO 

850 
ICCATTC 
890 



gatc; 



A i Au i i AGAA 
930 

AGAo i uAAA i 
970 

CAl IGAGACA 

1010 
CTGTAGCATT 

1050 
TCTTATTGTT 

1090 

GTTTTTATAT 

1130 
ATGTGTACGA 

1170 
TTATGATTAT 

1210 
GTCGTGTAAT 

1250 
TCAACCTTCT 

1290 
CTTTATTTAT 

1330 
AGCTAATGAT 

1370 
GCTTTCTAAA 

1410 
GAATATTGTT 

1450 
GAGTCTACTG 

1490 
TCAACAACAA 

1530 
CTTTCTAATT 

1570 
GGTCGTCTTC 

1610 
TGGTCGTCCC 

1650 
GTGAAAACAT 



UMl 



GT 



730 
TCTT 

320 
TTTT 

360 



^790 

; mAATlsClA 
830 



~. l i i i i 

900 

ATGTGCG f AT 
9d0 

GATTATATAG 

980 

TCTG i GATTG 
1020 
AGAAACTATA 
1060 
GTGCTCAAA.G 
1100 
GCTCTTCCAT 
1140 
GCTTATAATA 
1180 
GTCCTCCATT 
1220 
TTGCTGTTTG 
1260 
GCGGTGTTGG 
1300 
CATAGTAGCA 
1340 
AAGCCATTGA 
1380 
ATGAATCTAC 
1420 
GATACTTCTG 
1460 
AAGC 
1500 



i i 



CATATTCAGT 

1540 
TTCGATTATG 

1580 
I I I I I i I CAT 

1620 
CACCATTCAG 

1660 
CCACATTTTT 



i uAGCC i CAC 
910 

ATATATAGAG 
950 

TCGAAGAi .A 
990 



_ 800 
AAGC ! ! GTTA 
840 

Ui ini ATC i A 

380 

AGA I u i i u : C 

92G 

JSC 

LuAAAC t I GA 



1030 
G i i A 
1070 
CAACTTCATC 
1110 
TATCACCGAA 
1150 
TTACTGATGG 
1190 
AA.TTATTCTG 
1230 
TGATTGTACG 
1270 
TTGAAGTTCA 
1310 
TTTGATTATT 
1350 
AGGGAAGCAG 
1390 
GAATGGATGA 
1430 
CAATCAGATT 
1470 
CTGTTTCAGA 
1510 
GTAGTAGACA 
1550 
CACCCTCTTT 
1590 
GATGTCACTG 
1630 
GAAGTCACTT 
1670 
CAAATCCAGC 



_1000 
A ! GTA i ATAT 
1040 
GC i i CACTTG 
1080 
ATACAGTATG 
1120 
CCTTATGATT 
1160 
TGATTCAGTA 
1200 
TT7CATACAA 
1240 
ATAAATTGAT 
1280 
AGTAAATTAG 
1320 
GATGCTCTGT 
1360 
AAATGGTAAA 
1400 
TAAAGTTAAT 
1440 
ATGAGTTACT 
1480 
TGATCGATCA 
1520 
TGATCGATCA 
1560 
TCTCCAATTT 
1600 
AATTATTCTC 
1640 
CGAGCATAAT 
1680 
AGAATTTTCA 
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FIGURE i CONTINUED 

1690 _ 1700 1710 1720 

I lAAACGGGG i iCAACATTT ACTACATGTA TACACTC7GA 

1/30 1740 1750 l^n 

AG7C76AA.7C CACTAAHCT AGATGGTGCA 7CTGTGCCCC 
,, r , r _j^0 „ I 780 1790 1800 

umLAL, bi,j mAAGv.ii Am CTCAATTTTT 7AT7T7CCAA 
r , * 8 2 , r 1320 1330 T840 

CAAu^mA., i CAGACCACA CAACTCCCGT G7C77GTACG 

U ' CAGC fI£ GAG7GGAG ^ C7CAA7TAAG TG AC TTTAAC 

1890 1900 1910 iQpn 

GTCGAG77CT ATAGTAAACA ACCCCTATA? CTTTTTTCM 

1930 1940 1950 I9fin 

bwATGTTAA.G ATTGCGAACA CACTGAAATT TCCAGGTCGT 
„ _ 1970 1980 1990 ?nnn 

TAATCTTGTA CCCAGTGTGT GTACTTTTAA AAAAAAAAGT 
r- -Q1 Q 2020 2030 2040 

cAGTTTTTTA GTCTCTAAAA CACATTTAAA TAGAGTTTAT 
^ 2050 2060 2070 ?n«n 

'"IS TTGTTCC7CA TACTASKTT CGGAGTCAAC 
ACAACACAAC MCA 
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IGURE 2 

AMY 351 PROMOTER 
SEQUENCE iYPE: Nucleoli- 0 
MOLECULE TYPE: DNA fcenomiM 
9*19™& S ? UR i- r : SolanuR zLoerosum 



? 34 &d 



SEQUENCE LENGTH 
57RANDEDNESS: Douci= 
; OPOLOGY: Linear 
SEQUENCE: 

*wGi. ^..-.ULi.-oA TTTTTC7TCT 7CAATC77C7 

D ° 60 70 an 

A i AT7TM77 CG7T7TAGC7 7CAAAC77C7 7CAAT7TTA7 

T-r f9 100 HO 120 

. :umm, i iA* i r.iALAAAA AAAA.7C7C7A TTTAGCACCA 
130 140 150 160 

AA.T 7CATGCTCAA AA.TGGGCAAA CATAAAT* 
MS _ I 80 190 ?Q0 



l i CATAAAA7 TCATGCTCAA AA.TGGGCAAA CA7AAATAAT 
i'U 180 190 ?oo 

AAATGTGAAG 7AAA7AA7GG ATTAAAATA7 ATATTTTTGG 
210 220 230 ?dn 

GCCTCACATC AACCTTCATA ATTCT7GAAT GAATGAATGA 

cbQ 260 270 280 

TAGACTTCAT .AA7777T7AA CCTATACATA TAAGAAAAT7 
290 300 3in -jjn 

GAGAG7AAC7 CAAA7AACAA G77G7AG7A7 CACATCTTTA 
JJlJ 340 350 76n 

C i ATTTGATA ACA7TATGAA GGTGATTATA CATTACGTAA 
r. -r^ 370 380 390 d nn 

CAmCTTTT AAAAATATGT AAGCAAATTT AC7TT7TAAC 
-r 4 1° 4 20 430 a An 

TTATCAT7GA TCTTCATGGT TTTGTCATAA A7C7CAAAGT 
4bU 460 470 Afln 

IA7CA7ATTT TATATAGCTA TTTGAAAGTA A7TTTA7TTT 
4 90 500 510 q?n 

TAC7CA7CA7 TGAGTGATGC T7TTATTA7A ATACTAGTAA 

bJO 540 ccn 

GTTTTATTTA TTATTTTCTT 7TAGGGG7GA AT7GTATAAT 
570 530 SQO Ann 

ATAATAAAAA ATATATT7TT AGAAA7AATG ATTC7777A7 

7A77AAAAAG TTAAGATA7T AGATTATTTA 7GC7TG7A7A 
ooO 660 670 £an 

A I AA7GAACG AAG7777A7T 7TC7A7GAG7 nCATTAATC 
690 700 710 Ton 

ATGTTTGTAA T7ATTTCAAA T77TGATGTA TTTTTATAAT 

r 730 740 750 7fin 

i i i GTA7TA7 7A7ATTA77A 7AC7ATAT7T AAAAAT7TAA 
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■GURE 



"INUED 



AbMi^AG '^..mC^l ClACGTCAAG aggctgcgc 
.__^^2 10 °20 830 840 

LTTTcCtiAA ATTAAG7AAA ATT.-7Trcrr - r .-r r .~2rV 

.5?? SoO 870 880 



lulCTCCGCC TTTTAAAAr^ 

8?0 900 9io - 9?0 

uul ^ ^.iATTTA A7AAAAACAC 7TC~AATT" 

_ ?30 _ 940 o 50 

b i i ATAJG7 7CAA7TGAAC ATG7CCG7GA "7AGAAAATT 
_ 9/0 980 ago w, 

AAA, .AAAT7 CAATGACAAA 77T.AA7AA77 TGACACAAM 
lulu 1020 1010 in/in 

7TTATGAAAA AAATATCAAA ATA7AAAGAA ATATT7TTTT 
_i050 1060 1070 man 

iGAAAlGGAT 7AAAAAGAAA AAAAAAACAA ATAAATTGAA 
1090 uoo iiio ' 1 V5n 

tCuGuAT.AAG 77GG77G7TT AATTGA7TA7 TGAT7ATGA7 
liJU H40 lien ncn 

CTCAATTTGA CAT77TGCGC GATCTTTCGA CCTCAATTCG 
1170 H80 1190 i?nn 

7A7GAACTGA CACTACGCCA ATGGACAGTC GCCGTCGTCA 

CCGCCACCGC ACTATTCTCG ACGCGTCGTC TATCTCCTCC 
1"0 1260 1270 i?on 

ACCCCACAGC CGTCAATTCC AAGC7TCCAA 7GAACCGTTG 

l^yo 1300 mn loon 

CCATGTGTCA. C7GCCTATTC ACCGCGAAAC ATGAATATCA 

1340 1150 i ocn 

CTGACGAACG ATTTCGGAGC GGAACGAATC CAGAAAATGG 

ATTACT 1410 T CGAATCTCAA CTCWTTTGG 

7AAAAATAAA ATTAAAAATA TTG7TTCTTT 7TGTATTTCT 
1450 1460 1470 i4Rn 

S CTGGTT TATG 7GGTGATCGA ATTTTCAATT 
Ji?S ^ 1510 1520 



7-TTTTACTGG TAGTGATTCC TACTTTTCTT CAATTGCATT 
IsJO 1540 1550 i zcr\ 

.CTCCTTT7T CCAT7TCACG G7TGAGAATT CATGA77CCT 
1570 1580 1590 isnn 

TATCAGAGGA ATCGATCCGA 77TGACTAAT 7TCACT7TTC 

GTCTGTATAA ATACCAGAGT ATC7AGGTTG AGGAACGTM 
tt™ 1550 1660 1670 ififln 

7TTCAAGCTG CGATCGGCTT TTTCCCCTGA ACGAGCAAAC 
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FIGURE 2 CONTINUED 

J. 690 1700 1710 1720 

ACAGGiiGTG GGTTC3AGTT AGCAAGGGAC GTATAATCTC 
1730 

AAC7ACAATC CATT 
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nGUKE 3 

a -AMYLASE CODING SEQUENCE 
(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH. 2017 base oai.-s 

(3) TYPE: nucleic acic 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

l A) L-NGTH: 475 ammo acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS sinale 

(D) TOPOLOGY: linear ' 



A?I aiS Si? G , AT a? A ? CG I CG ™ T CAC GGT TAT TGG CAG AAA GAC CT 
Asp A.a Ala Asp Ala Tnr Ser Tyr His Gly Tyr Trp Gin Lvs Asd i «u 

7 ~ 80 85 



48 



96 



m,G .v..G rwi CiC GCC GCA ATT GCT GCT CTG CTG TCG CCC Ara CTG GTC 
Met Lys ber Leu Ala Ala He Ala Ala Leu Leu Ser Pro Thr L-u Vol 
-;8 -15 -io .5 ' 

CGG GCA GCG ACT CCG GAT GAG TGG AAA GCT CAG TCG ATC TAT TTC ATG 
Arg Ala Ala Tnr Pro Asp Glu Tr D Lys Ala Gin Ser Hp Tvr Phe Me^ 
1 5 10 

fI G ri G S AC ? G V 1 GCG CGT ACC GAC MT T ^ G ACC ACG GCT CCC TGT 144 
Leu Thr Asp Arg Phe Ala Arg Thr Asp Asn Ser Thr Thr Ala Pro Cys 

lb 20 25 30 

A?p Thr ThJ Ala C Gly GTATGCAACT ^CCCTGTGT TTCTCTTCCC GGGACGTACA 199 
35 

AGGGGTCTTC TCCATGCTAA CCGTGCACAT GCAG AAA TAT TGC GGG GGA ACA 251 

Lys Tyr Cys Gly Gly Thr 
40 

j'l Arg G G ly He' II S Z GTAAGTGGCT 'CTGATTTTC GCTCAATAAT 302 
45 

CTTCGTCGCG TGACTTTATT TCCTAG CTG GAT TAC ATC CAG GAT ATG GGC TTC 355 

Leu Asp Tyr He Gin Asd Met Gly Phe 
50 55 

S r G E!.l? A ! A A £T CCA GTG ACA GCC CAG TGG GAC GAC GAT GTG 403 
uihr rt lo lie irp ile ihrPro Val Thr Ala Gin Trp Asd A<d Asp Val 
oU 65 70 



450 

GTGCGCAACC CTGCTCCATG GATCGCTGGC TGCAAACTCG TGCTGATCGG TGATTTTTTT 510 

TTTT7TTTTT TTGAAACAG A TAC TCT CTG AAT TCG AAA TTC GGC ACT GCC 560 

Tyr Ser Leu Asn Ser Lys Phe Gly Thr Ala 
90 95 
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FIGURE 2 CONTINUED 

GAT GAC TTG .AAA GCC CTG GCT GAT GCC CTT CAC GCC CGT GGG ATG CT7 608 

Asd Asp _eu Lys Ala Leu Ala Aso Ala Leu His Ala Arq Glv Me - '< a u 
100 105 110 " 115 

CTC ATG 37C GAC GTC GTG GCT -AT CAC TTT GTACGGACCA TCTACATACC 558 
Leu Met Val Asp val Val Ala Asn His Prp 
120 125 

7GGGAAACGC GAAGAAGGAA AAAAAAAAAA AGGCGCACGC T.AACATTTCG CGTTTAG r 15 



GGC TAC GGC GGT TC; CAT AGC GAG GTG GAT TAC TCG ATC TTC AAT CCT 763 
Gly lyr Gly Gly Se r His 5er Glu Va 1 Asd ivr Ser 11= D he *sn P-c 
130 135 * 140 

CTG AAC AGC CAG GAT TAC TTC CAC CCG TTC TGT CTC ATT GAG GAC TAC 811 
Leu Asn Ser Gin Asp Tvr Phe His Pro Phe Cys Leu He Glu Asp Tvr 
145 150 155 ' 

GAC AAC CAG GAA GAA GTC GAA CAA TGC TGG CTG GCC GAT ACT CCG ACG 859 
Asp Asn Gin Glu Glu Val Glu Gin Cys Trp Leu Ala Asp Thr Pro Thr 
160 165 170 

ACA TTG CCC GAC GTG GAC ACC ACC AAT CCT CAG GTT CGG ACG TTT TTC 907 
Thr Leu Pro Asp Val Asp Thr Thr Asn Pro Gin Val Arc Thr Phe Phe 
175 180 185 

AAC GAC TGG ATC AAG AGC CTG GTG GCG AAC TAC TCC A GTATGATTGT 954 
Asn Asp Trp He Lys Ser Leu Val Ala Asn Tvr Ser 
190 195 200 

TCCCGCGGTA ACGCTTTAGG GCTTGCTCTA ACTGAAATCG ACAG TC GAT GGT CTG 1009 

He Asp Gly Leu 
205 

CGC GTC GAC ACC GTT AAG CAC GTG GAG AAA GAT TTC TGG CCC GAC TTr 1057 
Arg Val Asp Thr Val Lys His Val Glu Lys Asp Phe Trp Pro Asd Phe 
210 215 220 



AAC GAA GCT GCT GCG TGT ACC GTC GGC GAG GTG TTC AAC GGT GAC CCA i 105 

Asn Glu Ala Ala Ala Cys Thr Val Gly Glu Val Phe Asn Gly Asp Pro 

225 230 235 

GCG TAC ACC TGC CCA TAC CAG GAA GTG CTG GAT GGC GTT CTG AAC TAT 1 1 53 

Ala syr ihr Cys Pro lyr Gin Glu Val Leu Asp Glv.Val Leu Asn Tvr 

240 245 ' 250 

CCG AT GTGAGTGATT CCGAAAGTTC CATCGATCAG GCTTTCTGAC GCATGAGAAC P08 
Pro He 
255 
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"IGURE 3 CONTINUED 

AGC TAG TAT CCT GCG CTT GAT GCA TTC AAG TCT GTC GGC GGC AAT CTC 1256 

i yr ivr Pro A"; a leu Asp Ala ?he Lvs Ser Val Gly Giv Asn Leu 

250 ' 255 ' 270 

GGC GGC TTG GCT CAG GCC ATC ACC ACC GTG CAG GAG AGC 7GC AAG GAT 1304 
Glv Gly Leu Ala Gin Ala He T hr ihr Val Gin Glu Se'" C/s Lvs Asc 
275 230 ' 235 

!wC i ^;'o C i IjvjC AA i TT{_ C> i GAG AA. i CAC GAC Ai i GCT CGC ~ T ' 1352 
Ser Asn _eu Leu Glv Asn Phe Leu Glu Asn H: s Asd He Ala Arc Phe 
290 295 300 

GCT TC GTATGGACAC TCTTTTTGAA GCCCTCATCG ATTGGGGATG CTGACACGGA 1407 
A 1 a Ser 



CAACAACAAC AG G TAC ACG GAT GAC CTT GCT CTC GCC AAG .AAT GGT CTC 1456 
Tyr ihr Asp Asp Leu Ala Leu Ala Lys Asn Gly Leu 
305 310 315 

GCT TTC ATC ATC CTC TCG GAT GGT ATT CCG ATC ATC TAC ACG GGC CAG 1504 
Ala Phe lie He Leu Ser Asp Gly lie Pro He He Tyr Thr Gly Gin 
320 325 330 

GAG CAG CAC TAC GCC GGT GAT CAC GAT CCC ACA AAT CGT GAG GCC GTC 1552 
Glu Gin His Tyr Ala Gly Asp His Asp Pro Thr Asn Arg Glu Ala Val 
335 340 345 

TGG CTG TCT GGC TAC AAT ACC GAC GCC GAG CTG TAC CAG TTC ATC AAG 1600 
Trp Leu Ser Gly Tyr Asn Thr Asp Ala Glu Leu Tyr Gin Pne He Lys 
350 355 360 

AAG GCC AAT GGC ATC CGC AAC TTG GCT ATC AGC CAG AAC CCG GAA TTC 1648 
Lys Ala Asn Gly He Arg Asn Leu Ala lie Ser Gin Asn Pro Glu Phe 
365 370 375 380 

ACC TCC TCC AAG GTGAGTACAA TAACAAACTT TTCGAAAAAT TTTTCACCGG 1700 
Thr Ser Ser Lys 



AGAAAACCTA AGATTCGGCT AACAAAACAA. AAAAAAAAAA G ACC AAG GTC ATC 1753 

Thr Lvs Val He 
385 , 

TAC CAA GAC GAT TCG ACC CTT GCC ATT AAC CGG GGC GGC GTC GTT ACT 1801 
iyr Gin Asd Asp Ser Thr Leu Ala He Asn Arg Gly Gly Val Val Thr 
390 395 400 * 

GTC CTG AGC AAT GAA GGC GCC TCC GGG GAG ACC GGG ACT GTC TCC ATT 1849 
Val Leu Ser Asn Glu Gly Ala Ser Gly Glu Thr Gly Thr val Ser He 
405 410 415 420 
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"IGURE 3 CONTINUED 

CCG GGA ACT GGC TTC GAG GCC GGC ACG GAA TTG ACT GAT GTC £7C TCC *3Q7 
Pro uly Tnr Glv Phe Glu Ala Glv Thr Glu ' °u Trr va 1 r l- 

^25 * 430 435 " 

iuC AAG ACC GT3 ACT GCG GGG GAC AGC GGG GCG GTC GAC C-TG CCC 77" " 
t/s L/5 i.nr vai ihr Ala Glv Asd Ser G*v ~". ^ vai Asd v=" P-o i " " 

WO 445 ' 40 

TCG GGC GGA CTG CCA AGC GTG CTC TAT CCC AGC TCC CAG CTG GCC "AG tog 
>er b'y b.y Le'j Pro Ser Val Leu Tyr Pro Ser Ser Gin Leu Ala l'vs 
-33 460 465 

AG" GGT CTG TGT GCG TCG GCG TGA 
^er G 1 y Leu Cvs Ala Ser Ala 
470 475 
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a-AMYLASE COO 
ScOUENCE TYP. r 
MOLECULE TYP? 



iNG SEQUENCE 
Nuc 190!; ide 
ONA 

ORIGINAL SOURCE: Solanum Tuoerosum 
ScOUENCE LENGTH ^ -70 
SEQUENCE . 



P 20 30 AO 

■■J'-v5i-*-:L o^A.TT— CAA , i., i - ACT GAG7ATCTAG 

-iJ 60 70 30 

bi .GAGGAAC GTAATTTCAA GCTGCGATCG GCTTiTTCCC 
90 100 t 10 120 

-TuAACGAGC AAA.CACAGGT TGTGGGTTCG AGTTAGCAAG 
lj0 I 4 0 150 160 

ouAtu i ATAA iuTCAAClAC AATCCATTAT GGCGCTTGAT 
1/0 180 190 200 

'jAAAGTCAGC agtctgatcc attggttgtg ATACGCAATG 
210 220 230 ?40 

GAAAGGAGAT CATATTGCAG GCATTCGACT GGGAATCTCA 
250 260 270 PRO 

i AAACATGAT TGGTGGCTAA. ATTTAGATAC GAAAGTTCCT 
290 300 310 320 

GATATTGCAA AGTCTGGTTT CACAACTGCT TGGCTGCCT" 
330 340 350 360 

CGGTGTGTCA GTCATTGGCT CCTGAAGGTT ACCTTCCACA 
3/0 380 390 400 

GAACCn i AT TCTCTCAATT CTAAATATGG TTCTGAGGAT 
4 10 420 430 440 

CTlTTAAAAG CTTTACTTAA TAAGATGAAG CAGTACAAAG 
_ „ 4 50 460 470 480 

TTAGAGCGAT GGCGGACATA GTCATTAACC ACCGTGTTGG 
490 500 510 520 

GACTACTCAA GGGCATGGTG GAATGTACAA CCGCTATGAT 
b 30 540 550 560 

GGAATTCCTA TGTCTTGGGA TGAACATGCT ATTACATCTT 
d7 ° 580 590 600 

oCACTGGTGG AAGGGGTAAC AAAAGCACTG GAGACAACTT 
610 620 630 640 

TAATGGAGTT CCAAATATAG ATCATACACA ATCCTTTGTT 
650 660 670 680 

CGGAAAGATC TCATTGACTG GATGCGGTGG CTAAGATCCT 

690 700 710 

CTGTTGGCTT CCAAGATTTT CGTTTTGATT TTGCCAAAGG 
_ _730 740 750 760 

i lATGC: iCA AAGTATGTAA AGGAATATAT CGAGGGAGCT 
77 0 780 790 800 

GAGCCAATAT TTGCAGTTGG AGAATACTGG GACACTTGCA 
810 820 830 840 

ATTACAAGGG CAGCAATTTG GATTACAACC AAGATAGTCA 
850 860 870 880 

CAGGCAAAGA A i CATCAATT GGATTGATGG CGCGGGACAA 
890 900 910 920 

CTTTCAACTG CATTCGATTT TACAACAAAA GCAGTCCTTC 
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9^0 


950 


960 




CAAAGGAGAA 


• CTGGCG i i 


i GCGTGACTC 


97 q 


980 


990 


^ 1000 


■ AAGGuoAAu 


- ^ ^ r n p ^ a 


' ; i t AGGATT 


^ : oClu I i CA 


1010 


1020 


1030 


1040 




•w ' . i A • uA 


-a i lAloAC 






1050 


1070 


1080 








i 'oTT.- 1 oG.^ 


1090 


1100 


1110 




UUU^ i A • GuA 


TACATTCTA 




GATACCATCA 


1130 


114Q 


1150 


1160 


Gill 1 C~G 


ACCA i i I CTA 


CGAATGGGA7 


AATTCCATGC 


1170 


1130 


1190 


1200 


A ^ .T' ^ *. ax 


i 'j i AAAu\- 1 u 


A7TGCTATT;" 


GGAGGAA.TCA 


1210 


n o J220 


1230 


1240 


AGGCATACAC 


AGCCG i i CA i 


CTATAAGAAT 


TCTTGAGGCA 


1250 


1260 


1270 


1280 


CAGCCAAACT 


TATACGCTGC 


AACCATTGAT 


GAAAAGGTTA 


1290 


1300 


1310 


1320 


GCGTGAAGAT 


TGGGGACGGA 


TCATGGAGCC 


CTGCTGGGAA 


1330 


1340 


1350 


1360 


AGAGTGGACT 


CTCGCGACCA 


GTGGCCATCG 


CTATGCAGTC 


1370 


1380 


1390 


1400 


TGGCAGAAGT 


AATCTTACAG 


CTATTCCGTT 


ACTTAATATA 


1410 


1420 


1430 


1440 


TTAGTAGAAA 


TATATATGTT 


TTAAACCCGA GCACCTACTT 


1450 


1460 


1470 


1480 


CTAACACTAG 


ATCCGCCTCT 


ACAGGCTTGG 


ATGGAGTGAT 


1490 


1500 


1510 


1520 


GAG i 1 1 i i i i 


TTCCTGTTCA TTAGACATTG CAACATGGGA 


1530 


1540 


1550 


1560 


i G i ATG Mil 


GTTAATAAAA GTGTTCTTGA 


TCAATGCAAT 


1570 






GTAATAAGGG 
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SEQUENCE : 



SEQUENCE T-'PE: Nut.ci 

MOLECULE TYPE DNA 

ORIGINAL SOURCE BAR! :"Y 

SEQUENCE LENGTH; 2037 

STRANDEDNE3S: D0U6L-" 

TOPOLOGY: ! INFAR 



Nucleotide ssauence of a cONA enccdina -he la roe subur.-: r.f aqP- 
V'r'r-t Dyroonosphoryl ase from parley seed endosperm •.oeplIO) 



81 

121 

161 

201 

241 

281 

321 

361 

401 



601 



801 



1001 



1201 



1401 



1601 



ACGACCACC i 

ctcctctccc 
cttctccctc 

gtccccggca 
cagttcagca 
tttccccagt 
caagatcggg 
tccaggagga 
caccggtgcg 
gccgacaccc 
acgccgatcc 
tcatactcgg 
cacaagcaca 
tgttacaggc 
acagtggcat 
ctcggcatct 



ccgaactcaa cgcctccacg gaccatctct 

lACCACCACC 



-CCCTC AC 



ggcgggggaa 
tggccgcgac 
ccgcggaaca 
cttgaggact 
tgatcttgtc 
ggagcttgtg 
actttatcat 
agtacgggct 
ccagttttct 
atgaaagtgg 
accctgctaa 
tgtcttcaag 
agatacgcag 
cgagagctct 
cactgactac 
ttcgatgcga 
ttgaatttta 
tcggtactta 
aaagaagcga 
gcaaaatcga 
aaactccgga 
gcggactcgt 
tgtctgaggg 
aaagatcagc 



cctgca 
gtcgccaccc 
gttgcaggtg 

caggagagag 
gacagcagca 
tgtgcaacgg 
cagtgcgtgc 
ttgttctccg 
gaacgaggtc 
cggcggcacc 
agggccacac 
tcatcgatat 
caacaagata 
ctcaatcgcc 

TCAATTTCAC 

acaaatgcct 
gcggatgccg 
actataagca 
gggcgatcag 
cagaaacatg 
gtgcccctgt 
agtgaagttc 
gagaagccaa 
ataccagttt 
atatccatac 
agagatgttc 
aactacatga 
gcatgatcac 
tgggaggaca 
acatggccct 
tgatccaaaa 
ccgccaacaa 

TCATTTCGCA 
GCACTCCATC 
AGCGAGCTCA 
ACGAGACCGA 
CAAGGTTCCC 
AACTGCATCA 



A i jCG i i CA I 

i LuA I CCl 



* ^ r a r* ? r r r 



GACTGCTAA 
CCTGGAGGGC 
GGATCGGCCT 
GCATCAGGCA 
CGGCGCAGGG 
TCACCTCCGA 
GACGTCCTTC 
GCGGCCGTCG 
GGGACTCAGC 
CTGCTGTTCC 
TCCCATGAGC 
TTCGTCATGA 
ACATTCACCG 
TGATGGATCT 
GGGGAGGCTG 
TCAGAAAATT 
TAAATCCATA 
CTTTATCGCA 
TGGATGACAA 
TGGAGAGAGC 
GACAGTTCAG 
AGGGCGACGA 
TCTCAATTTC 
ATTGCTTCGA 
TGCTGAACCT 
CTTTGGGTCT 
AATGTACAGG 
TTGGAACAAT 
CTGCGAACAG 
ACCCCCTTCT 
AGTCAGACAA 
CGGCTGCTTC 



ATCGGCGTTC 
AGAACGCGAT 
GGACGAGATC 
ATCGGCGTCG 
i CGACATGAA 



Ai i^AiCCGT 
TCACCCCGCC 
GTCATCGATG 
AAGGCGTGCG 
GCGAGCGCCT 
CGAGAGAGCG 
GCCCCGCCGC 
CGCCAGCCCG 
CGGAGGAATT 
GTCGCGGCCG 
TCTTCCCGCT 
TATTGGAGGA 
AACTGCTTCA 
CCCAGTTCAA 
CACCTACCTC 
GTTGAGGTAT 
CTGGATGGTT 
TATCTGGGTG 
GAGCACATTT 
TGGATTACAT 
TGCTGACATT 
CGGGCATCTG 
GCCGTGTGAT 
TCTGGAAGCG 
GCCATAGACG 
TGGGAGTTTA 
TCTAAAGTCA 
GAAATCCTCC 
CATATGTCTT 
CAGATCCTTC 
CCTCCAAAGT 
TCACTTCGCC 
GTGCAGGATC 
TTGCGTGAAT 
GTTCACGCCT 
GATGATGGGC 
TCGAGGCTGA. 
GGGAGAACAC 
CGCGAGGATA 



WO 96/294 J 6 ^ 
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.-ibURE 5 CONTINUED 

GGAAGGGACG TGGTCATCTC AAACAAGGAG GGGGTGCAAG 
AAGCCGACAG GCCGGAGGAA GGGTACTACA TCAGGTCCGG 
GATCGTGGTG ATCCAGAAGA ACGCGACCAT CAAGGACGGC 
ACCG i CGTGT AGGGCGTGCC GGGTCGGCGC GACGGGGTTr 

1301 7GCGACAACC TGTGCGCTGC GTCGGTCGTC ATCAT"TTCf 
CAAACTCCGG GACTGAAGAA GTGATCCGGG GACGGGAGAC 
a ; , i GAAGCT 7GAATGAC7G AGAC7GAAAG 7GAAGGCGCA 
5CAGAGGCAG GCAGCATTAG "AGTAAGTAG TAAGTAAGT* 
GCAGTGGAAC AAAG7AA7AG "G77CGTT7 7TCCCCTG7A 

-00i A i AAATAAGA GGCTG7G.G7 TG AGGT.AAAA .AAAAAAA. 
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SEQUENCE: 

SEQUENCE TYPE - 
MOLECULE TYPE- 
ORIGINAL SOURCE: 
SEQUENCE LENGTH 
STRANDEDNES" ■ 
TOPOLOGY: 
COMMENT . 



Nuclectice sequence of a cDNA encoding the small subumr of AD°- 

?'-,^F°f! oyropnosDncryiase rrom oar lev seed endosoerm «'beDS) 
NUClEil .-uD 
ONA 

BARLEY 



DOUBLE 
LINEAR 

■he " . ' 



zi 1569 denotes a Dunne. 



I 4AAAGTGAAC 
ATTTTATATi" 
AAGTTCCCTT 
CGTTTATAGT 
AATCCCCATG 
201 GAGGTGGTGC 
GCGTGCAAAG 
CTTATTGATA 
TATCAAAGAT 
TCTTAATCGT 
401 GGAGGTTACA 
CACAGCAGAG 
TGCAGATGCT 
CATAATGTTA 
TGTACCGAAT 
601 AGAAACGGAT 
GATGAGGAAC 
ATGAAGAAGG 
AGGAGAACAG 
CTTGGCCTTG 
801 TTGCTAGCAT 
GCTTCAGCTT 
TTCGGAAGTG 
TGAGGGTACA 
TATTGGTACA 
1001 ATTACCAAAA 
GTTCTGCTCC 
TTCAAAGGTT 
GGTGAAGGAT 
CAGTAGTTGG 
1201 AATAGAGGAC 
ACTGAAGCTG 
TTCCCATTGG 
AATCATTGAC 
ATAATCAATG 
1401 CAGATGGATA 
CAAGGATGCT 
AGATGTGAAA 
AGTCTGGAAT 
AATAAAAA.G 
1601 TTCCCCCCTT 



: CACACATCA 
SCTCGGTGAT 
GlCC i CCCC : 
CATAAGAGCT 
CTATTGATAG 
AGGGACTAGA 

CCTGCAGTGC 
TTCCTGTCAG 
CTATGTGCTT 
CATCTCTCAC 
AGAATGAAGG 



:agataac 



GTAAGGCAG . 
TGGAGTATCT 
GGACTATGAA 
GCTGATATTA 
GTGCAACTGC 
GAGGATAATT 
TTGAAAGCTA 
AAGATGCGAG 
GGGTATCTAT 
CTCCGTGAGC 
AAGTTATCCC 
AGCATACCTA 
ATTGAGGCAT 
AACCAATACC 
CATTTACACA 
CTTGATGCTG 
GTGTTATTAA 
ACTCCGTTCC 
ACGTTGCTAA 
ATAAGAAACT 
TATTGGAAAG 
AAGAATGCTC 
TTGACAATGT 
CTTCATCAAA 
TTACTCCCTA 
TGTATGCCAA 
CAACCAACAA 
GAGTGCCATG 
GATGTATTAG 



CTCAATATC7 
GGATGTACCT 
TCCAAGCATG 
CATCGAAGCA 
TGTTCTCGGT 
TTGTATCCCC 
CATTGGGTGC 
TAATTGTCTG 
ACACAGTTCA 
GAGCCTATGG 
ATTTGTTGAA 
CCTGA.CTGGT 
ACTTGTGGCT 
AATTCTTGCT 
AAGTTTATTC 
CTGTTGCTGC 
ATTTGGCCTT 
GAATTCGCAG 
TGATGGTTGA 
GGCAAAGGAA 
GTTATTAGCA 
AATTTCCTGG 
TGGTGCAACT 
TACGACGGTT 
TCTATAATGC 
TGATTTCAGT 
CAACCTCGAC 
ATGTGACAGA 
AAACTGCAAG 
TGCATATCTG 
TGGGTGCGGA 
CCTTGCTGAA 
AATTCACACA 
GTATTGGAGA 
TCAAGAAGCG 
AGTGGCATCG 
GTGGAACAGT 
AAGACAGGGC 
GGCCGCGAAG 
CGAGTCACTT 
GAACTGTGAT 



ATATCC i i CC 
i i uulA i C i A 
AACAATGCAA. 
TGCAGATCTC 
ATCATTCTTG 
TGACGAAGAA 
CAACTACAGG 
AACAGCAACA 
ACTCAGCTTC 
GAGCAACATT 
GTCCTTGCTG 
TCCAGGGTAC 
ATTCGAGGAG 
GGAGATCACC 
AGGCACACAG 
CTTGCCCATG 
ATGAAAATCG 
AGAAACCAAA 
TACGACCATA 
A7GCCTTATA 
AACATGTGAT 
AGCTAATGAC 
AGCACTGGCA 
ACTGGGAAGA 
AAATTTGGGA 
TTCTATGACC 
ACTTGCCTCC 
CAGTGTAATT 
ATACACCATT 
AAGGTGCAAT 
CTACTATGAG 
AAAGGTGGCA 
TCAAAAGAGC 
TAACGTGATG 
GCGAGGGAGA 
TAACTGTGAT 
CATATGAAGC 
TACTTGCGTC 
GAGATCATAA 
CTACACCCTT 
GTACAAGCAA 
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CTGTGATGCA CT7ACGCGAA GTGCCCCTGG ATTCAGCTTT 
CTCTTTGCTT GT.AACTGGTT TCCAGCAGAC CATGCTA7TT 
GTTGTATGGT "CGTGCAAAA CCTTGCGATG CTTTATATAT 
GCTTTAiATA 7AAACAAGA7 GAATCCCCGC GCG7TGCTGC 
2001 GGCACAAAAA AAAAAAAAAA AA 
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a-GLUCAN LYASE COOING SEQUENT 
SEQUENCE TYPE: NUCLEIC ACID 
MOLECULE TYPE: QNA (GENOMIC) 
ORIGINAL SOURCE: FUNGALLY IN.---; 
SEQUENCE LENGTr 32r7 P.p 
S: RANGE ONE 5" nnup.i £ 
SEQUENCE. 

10 ?■". 



ALGAE 



c i 

i?i 

131 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
21Ci 
2161 
??2i 

2281 
2341 
2401 
2461 
2521 
2531 



b'w'JUrtbU i LA 

CGCAAAACCA 

bLuub ; AGTA 
: ubuu t I i CA 
uuCATCACTG 
ATCACTGTCC 
G i i CGC i i CA 
CAACTAGATT 
ACATCAGAAG 
TTCAAGACCA 
GGAACTGCAT 
AACGCTATCG 
TATGGTGCAG 
ATCGCCATGA 
CCGCATCATG 
TGGTTGATCG 
GACAATGTCT 
GACCTGGCAT 
GGGGGTGGGA 
AACCAAGTTC 
GGTGi i i iCG 
TCAGTCGAAG 
GTGGACGTGG 
GCAAACAGGG 
GACAAAGGCC 
CAAGACTACG 
CTGACGGGTA 
CTGGACTATG 
GTGGCCGAAT 
: GGC.AAGACA 
AAACCGGATG 
ACGTACCATC 
ATGGTCACTC 
ATCGTGGAAA 
GGTTACATTG 
AACTACATCC 
GTCGGCTCCG 
ACCGGGGACT 
CACTATGATA 
TATCCGAATG 
GTGTTGTACA 
TCGATGTACA 
GGACATGATG 



uu i C.AAA i G 
ATCGCCTCA.A 

L.-ujr".Lrt.n i >_•_ 

GCCCCCTCTC 
Ai.iGGACGGC 
AGCATCCCGT 
ACCCTGATGG 
GGATTCGAAC 
GTTTCTTGAC 
GAGTTACGAG 
CGTCCGGGAA 
CTTCCGTCAA 
GTGAAGTCAA 
CAAATTACAA 
ATGGTGCCCT 
TTAATGGATG 
CTCAATCTTA 
ACATGGGCGC 
TGGAATGTGT 
TCAACAAACG 
GGACTTCTTC 
AAATTGTAGA 
ATATGCAAGA 
TGGGTACTGG 
TTGTTTGTCA. 
AGGTCAATCA 
CGGA.TTTTGG 
GGGGTGGAGT 
GGTGGGGAAA 
TGACTGTTCC 
GGAATTGGCC 
CCCAAGTGCT 
AACGCAACAT 



TGTCGCACCt 
iCGTATCCAT 
TG i A I CCATG 
GGACGG i ATC 
CAACACGAAC 
TACAATGAAT 
TCAGGTTCAG 
CCCTATTCGT 
GCAGGAGCTG 
TGAGACC 



I 

AAAGTCTGAC 
CAAGTGCCGG 
CAAGAACTTC 
CTGTAAGTAC 
CTACGATAAC 
CAACCCAGAC 
CGCCGGTACT 
CATGAATACT 
GCAGTATGGA 
GGTCACAGCG 
TTCAGTAATG 
CTTGTTGAGA 
AGGTTATCAA 
CAACTTGCGG 
CGGGGATCCA 
GACAAATATA 
GACGTTAAGG 
AATGACCGAC 
AGAATGTGAT 
TAACTATAAG 
AGCAATGATG 
GAATGCGGAC 
TGTAACTGAT 
TCATGCGTA" 



-CCGCATTGT 
GACTACAAGA 
TGGTTTGCTG 
GTCAACTTCG 
GTCACGTCAT 
GATGTGACTC 
TCAGAGGGAT 
AAGGATCTAA 
GGCAAGGTCA 
GGATTGATGT 
CGCAACGACG 
CAGGACACCT 
TTGAACTATA 
TATTATATTC 
TCGGAGCAGT 
GGAGATACTA 
CCATTTGACC 
TTCTCTCTTC 
CCTCCGAAAT 
GCGCATATGC 
AACAACAATT 
GTGTTCACCA 
AACAACCGAT 
ACTTGCTTCC 
GAGAGGCAGT 
GACGGCCCCA 
GCAC 



.— •/-*■- -~ * . . ~ 

uAGCTb i uCA 
ClGAlAAAlA 
Cl i AC G ATT A 
CCGGCTCTC 
ACCuTA j CGA 
ACAACAACAA 
GTGGGCCTAT 
GTGATCCCGG 
GCGTCATCAT 
TCATGGAAAA 
TCGTTGATAG 
CGGTCAAGCA. 
ACATCTTAGA 
ACCAGTGGGA 
CAATGTACTA 
ACTCGTATGG 
CCTGGAATTC 
AACA 



ACGCAGACAC TCTAACGAAG 
GTAACCAGCA 
AAATGATGAT 
ACATTGGAGG 
TGATGGTGAG 
GGTGGATCGA 
AAATGGATAC 
CGGCCATGTA 
ATAACGACTC 
GATATCGCAT 



CGGGGGT 
TGCCAACAAT 
ATTCACCTCA 
GTATGTGCAG 
GTCCAAGGAC 
GTTGAGGAAG 
CCAGAATGCG 
AAACGTTCGC 
TCTGTGCGCG 



TTCC 
AAACTGTTCA 
CCGCACAAAA 
GATCCGTCCA 
ATGCGTTATG 
ACACTGTGCG 
TTCCGCCGTA 
A.TGTGGGTGG 
ATTAACATGA 
TACGACAATG 
GCGGGCTGCC 
CACGGAAAGG 
TTCGTTGAAT 
GCTTTCGGAA 
AGGGCGCAGA 
CCTGTTGTGT 



TGT 

TACAAGGCAA 
ACGTCTTTGG 
CAGCAGGTGA 
TCCCTTTCGA 
CGA AGGGC GA 
CGGTTTTTGA 
TGAGGAATGA 
TGTACACGAA 
GCGATGCGTA 
CAGACTGGGG 
GCATTGGTCT 
TTGGCGA i GA 
ATGGACAATA 
AGAATCATGG 
AGTCTACTAG 
GCTACATTAT 
GAGACAACTC 
ATATGTCTTG 
AGAATCAGC3 
TG 1 1 GCCG i G 
ACTACCAGGA 
TCCGTTATCG 
AGCCGATTAT 
ACGATCATTT 
GGGAGAATTC 



A .\<*."-T«"- 

.-.-'i.vJ'Ji. I .--L : 
r t ■ — ^ * — « 
: v^VJUMU I - 

: ■-•L'w'wCGbG I 

CAATCCGTC: 
L^uv_ i ACAGG 
CCTCAAGCAG 

AATGACTTTC 
CTACGGAAAT 
TGATGAAGTT 
ATTATACGGT 
GGAGGGATTC 
ACGCACTGGA 
CCTTAGACCT 
CGCAGCACCT 
ATGGTTCATG 
TGGACAAGAG 
TTACGGTGCT 
GGAGTTCGAG 
TTTCTTCCAG 
GAACAACATC 
GGGGCTCGCT 
ATTTTGGACC 
ATGGGCACAT 
TAACGAGGGG 
GAACGACTCC 
CATCGGTCAT 
ACGGCCTGAC 
CGA.CTTCGTC 
CATCAATGTG 
CAACTGGAAG 
TCGGGAACCG 
GAjfGGAAGGG 
CAGTCGTGGT 
TACTACATCA 
CTTGCCTCTC 
AA.CGCCGTGT 
G i ! CAGGAAC 
GCTGTACATG 

lA.muol i G w : 
CC i i CTTGGT 
GACCGAACGC 
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26*1 
2701 
2751 
2821 
2381 

3001 
3061 
3121 
318: 
3241 



GAATTGTACT 
CCTCTGGAAG 
G.AATCACCA.A 
GAAAA.CAAAT 
CTCGGAAACA 
GCTGAAGACA 
GAGACGATAA 
CGAGTGCGCG 
GACA7GAAGG 
GATTTCTGGG 
CCGGACGCTG 



TGCCCGTGCT 
bAGCGATGAA 
TCTTCGTGAG 
CATTGAACAC 
ACCGTGCCGA 
ATGGCAAGTT 
CGTTTACGAA 
GCGCTCAGTC 
JGAGCTC : 3C 
■ iGACCAGGA 
TGATCACAAT 



GACCCAATGG 

CGGAGGGGAC 

AGAAGGTGCG 

GTACACGGAC 

CGG i Aj_GTGT 

CTCTGTCGTC 

TGATTGCTAT 

GCCGTCGAAC 

CACTTCCAGG 

GACAGA. i i C i 

TACCTAA. 



TACAAATTCG 
CGAATTTACA 
aTTCTlCl ' A 
GAAGATCCGT 
7ATC7TGATG 

rvAbu I GGCAG 

GAGTACG1 r 
ATCCACGTGT 
GCTGCGC ; G T 
^ * G J GuC i uA 



GTCCCGACTT 
ACTACCCTG7 
CCCGCTACAC 
t GG i G t i i GA. 
ATGGCGGTG7 
CGGAGCAGGA 
i t oGAC'^ 
CTTCTGGAGC 
TCAATGACGG 
AG i ;GCCCAA 



TGACACCAAG 
ACCGCAAAGT 
GTTGAACGGT 
AGTATTCCCC 
GACCA.CCAAT 
i i GG : ACG 
GTTCTACGTT 
GGGTTCTCAG 
GGAGAACGGT 
i i 'o i i CTC 
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FIGURE 3 

a-GLUCAN LYASE CODING SEQUENCE 

SEQUENCE TYPE: NUCLEIC ACID 

MOLECULE TYPE; QNA (GENOMIC) 

ORIGINAL SOURCE: FUNGAL LY INFFC7ED ALGAE 

SEOUENCE LENGTH: 3276 3P 

5 i RANDtDNESS : DOUBLE 

SEQUENCE. 

10 20 30 

1 ATGTATCCAA CCCTCACCTT CGTGGCGCCT 
61 GTGGGCATTT TTAGGTCACA CATTCTTATT 
121 GTGCGCAAAA GCAACCGCC7 CAATGTATCC 
181 GuACTGGAG GGAAGGACAA CCCGGACAAT 
241 GTGTGGCGCT TCGACCCCCT CAGCAATACG 
301 GGCGAiATTG ACGACTGGAC GGCGAC.AATG 
361 iCCTTCACTC TCGAGAAACC GGTTCAGGTT 
421 AGGGTTCGCT TCAACCCTGA TGGTCCTATT 
481 CAGCAACTAA ATTGGATCCG GAAGCAGGAG 
541 i i CACAA AAG AAGGTTTCTT GAAATTTGAG 
601 AATTTTAAGA CTAGAGTTAC GAGGAAGAGG 
661 GTGCCGGCAG GATCGTTAGG GAACAAGTGC 
721 GGCACTGCCA TCGCTTCCGT TAATGAAAAT 
781 TTCTATGGTG CAGGAGAAGT AAACTGCGAG 
841 TACATCTTAG AACGAACTGG AATCGCCATG 
901 AACCAGTCAG ATCTTATTGC TCCAGGATAT 
961 TATTTTGCAG CACCTTGGGT AGTTGTTAAG 
1021 TCGTACGGAT GGTTTATGGA TAATGTCTCC 
1081 TGGAACTGTG GAGAGGAGAA CTTGGCATAC 
1141 CATTTTGTGT ATGGTGATGG AGATGGTCTT 
1201 CAAGGCAAAG AGTTTGAGAA CCAAGTTCTG 
1261 GTGTTTGGTT ACTTTCAGGG AGTCTTTGGG 
1321 GAGGGTGGTA ATAACATCTC TGTTCAAGAG 
1381 CCTTTAGAGG GGTTAGCCGT AGATGTGGAT 
1441 AAGATTGAAT TTTGGACGGC AAATAAGGTA 
1501 GTGTTTGAAT GGGCACATGA CAAAGGCCTT 
1561 AGAAACGACA ACGGCGGGGC AGATTACGAA 
1621 TACACGAAGA ATGACTCACT GACGAACACT 
1681 GATGCGTACA TTGGACATCT GGACTATGGT 
1741 GACTGGGGTC GACCGGGTGT GGCTGAATGG 
1801 ATTGGTCTGG ATTTCGTCTG GCAAGACATG 
1861 GGCGACGCAG TCGATACGAG ATCACCTTAC 
1921 GGACGATACA ATTGGAAATC TTACCATCCA 
1981 AATCATGGAA GGGAACCGAT GTTCACTCAA 
2041 TCTACGAGGA AGGAAGG6AT TGTTGCAAAT 
2101 TATATTATCA GTCGTGGAGG TTACATTGGC 
2161 GACAACTCTT CCTCCCAAAG ATACCTCCAA 
2221 ATGTCTTGCC TTCCACTAGT TGGGTCCGAC 
2281 AACGTGTGTC CCGGGGATCT AATGGTAAGA 
2341 TTCAGAAACC ACTATGGTAG GTTGGTCGAG 
2401 CTGTACATGT ACAAGGACGA GATGGCTACA 
2461 TGGCAGGAGG TGTTGTACAC TGCTATGTAC 
2521 AAGGCAGCTT CCATGTACGA CAACGACAGA 
2581 CTTCTCGGCG GACACGATGG ATATCGTATT 
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*0 50 50 

: _ i 

mG i GCGC i AG buGCCAGAAC : * i CACG i GT 
CATTCGGTTG TTCCAGCGGT GCGTCTAGCT 
ATGTCCGC i i iGTTCGACAA ACCGACTGCT 
ATCAATTACA CCACTTATGA CTACGTCCCT 
.AA.CTGGTTTG CiGCCGGATC TTCCACTCCC 
AATGTGAACT TCGACCGTAT CGACAATCCA 
CAGGTCACGT CATACAA.GAA CAAT7GTTTC 
CGCGATGTGG ATCGTGGGCC TATCCTCCAG 
CAGTCGAAGG GGTTTGATCC TAAGATGGGC 
ACCAAGGATC TGAACGTTAT CATATATGGC 
GATGGAAAAG GGATCATGGA GAATAATGAA 
CGGGGATTGA TGTTTGTCGA CAGGTTGTAC 
TACCGCAACG ATCCCGACAG GAAAGAGGGG 
TTTTGGGACT CCGAACAAAA CAGGAACAAG 
ACAAATTACA ATTATGACAA CTATAACTAC 
CCTTCCGACC CGAACTTCTA CATTCCCATG 
GGATGCAGTG GCAACAGCGA TGAACAGTAC 
CAAACTTACA TGAATACTGG TGGTACTTCC 
ATGGGAGCAC AGTGCGGTCC ATTTGACCAA 
GAGGATGTTG TCCAAGCGTT CTCTCTTCTG 
AACAAACGTG CCGTAATGCC TCCGAAATAT 
ATTGCTTCCT TGTTGAGAGA GCAAAGACCA 
ATTGTCGAAG GTTACCAAAG CAATAACTTC 
ATGCAACAAG ATTTGCGCGT GTTCACCACG 
GGCACCGGGG GAGACTCGAA TAACAAGTCG 
GTATGTCAGA CGAATGTTAC TTGCTTCTTG 
GTCAATCAGA CATTGAGGGA GAAGGGTTTG 
AACTTCGGAA CTACCAACGA CGGGCCGAGC 
GGCGGAGGGA ATTGTGATGC ACTTTTCCCA 
TGGGGTGATA ACTACAGCAA GCTCTTCAAA 
ACAGTTCCAG CTATGATGCC ACACAAAGTT 
GGCTGGCCGA ATGAGAATGA TCCTTCGAAC 
CAAGTTCTCG TAA.CTGATAT GCGATATGAG 
CGCAATATGC ATGCGTACAC ACTCTGTGAA 
GCAGACACTC TAACGAAGTT CCGCCGCAGT 
AACCAGCATT TTGGAGGAAT GTGGGTTGGA 
ATGATGATCG CGAACATCGT CAACATGAAC 
ATTGGAGGTT TTACTTCGTA TGATGGACGA 
TTCGTGCAGG CGGGTTGCTT ACTACCGTGG 
GGCAAGCAAG AGGGAAAATA CTATCAAGAA 
TTGAGAAAAT TCATTGAATT CCGTTACCGC 
CAGAATGCGG CTTTCGGGAA ACCGATTATC 
AACGTTCGCG GCGCACAGGA TGACCACTTC 
TTGTGTGCAC CTGTTGTGTG GGAGAATACA 
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FIGURE 5 CONTINUED 

2541 ACCAGTCGCG ATCTGTACTT GCCTGTGCTG 
27G1 GACACCAAGC GCCTGGATTC TGCGTTGGAT 
2761 CCACAAAGCG ACTCTCCGAT ATTTGTGAGG 
2321 ~GGACGGTT CGAACAAGTC AATGAACACG 
2831 GTATTCCCTC T7GGAAACAA CCGTGCCGAC 
2941 ACTACAGATG CTGAGGACCA 7GGCAAA7TC 
3001 3GTG77ACGA CGACGATCAA GTTTGCGTAT 
3061 77C7ACG77C GA.ATCCGTAA TCTTACGACT 
3121 GGTGAA.GAGG ACATGACACC GACCTCTGCG 
3131 GGTGTTGGAG AATAC7GGGC 7GACAA7GAT 
3241 CTGG7TC7GC AAGACGC7G7 GA7TACCA7T 



ACCAAATGGT ACAAA.7TCGG CCC7GAC7A7 
GGAGGGCAGA TGATTAAGAA C7A7TC7G7G 
GAAGGAGC7A TTCTCCC7AC CC3C7ACACG 
7ACACAGACA AAGACCCG7T GG7GTTTGAG 
GGTATG7G7T A7CTTGA7GA 7GGCGG7ATT 
T :7G7TA7CA. A7G7CGAAGC : — ACGGAAA 
GACACHATC AA7ACG7A77 J_3A7GG7CCA 
GCATCAAAAA 1 1 AACG i G i C . . C7GGAGCG 
AAC7CGAGGG CAGC7TG7T CAG7GATGGA 
ACG7C7TC7C TG7GGATGAA GTGCCAAAC 
A.CGTAG 
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a-GLUCAN 
SEQUENCE 
MOLECULE 
ORIGINAL 
SEQUENCE 
STRANGE ONE 33 
SEQUENCE: 



LYASE CODING SEQUENCE 
TYPE: NUCLEIC ACID 
lYPE: DNA (GENOMIC) 
SOURCE: FUNGUS 
LENGTH: 3201 *P 
DOUBLE 



1 0 



- ' iJU^r-.UUlH i (lit 

: AGAl : GGA 

130 

AAGTTCCCCA 

^ J.90 

G : i CAG i i CA 
250 

GACGAGTATG 
310 

AATAAATTGG 
370 

TTCTTTACCT 
^30 

GTCGGCGATG 
490 

ACCTTGACCC 
550 

GTGTCCGACA 
610 

CAACACAAGA 
670 

GTGGGGTGGG 
730 

TTTAACTTCG 
790 

GAGCCACTGT 
850 

AATATCACGG 
910 

AACTCAGGCT 
970 

GCGGATACGG 

1030 
AAGCCCAGAT 

1090 
TTGTATTCTG 

1150 
GATGTCGATG 

1210 
CCCAAAGAGA 

1270 
CCTGTTATCA 



20 
GATCC 
80 

140 

AAAA.CTGGCA 
200 

TTAGGCCGTG 
260 

GTGATGAGAA 
320 

ATACTTATAG 
380 
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1330 13*0 1350 1360 1370 1380 

AAAAAATACT i , ATCATGGA CGACAGATAT ACCGAGGGAA CAAGTGGGAA. TGCGAAGGAT 
_ 1390 _1400 _ 1410 1420 1430 IUQ 

G i ! CGGTACA TGTACiACGG . GGTGGTAAT AA.GGTTGAGG TCGATCCTAA TGATGTTAA.T 

1450 1460 1470 1480 1490 1500 

GGiCGGCCAG ACiiAAAGA CAA.CTATGAC TTCCCCGCGA ACTTCAACAG CAAACAATA" 
1510 1520 1530 1540 1550 



CCCTATCATG GTGG i GTGAG CTACGG . i A i GGGAACGGTA GTGCAGGTTT TTACCCGGA" 
1570 1580 1590 1600 1610 



i! 



CTCAACAGAA AGGAGG i ■ CG ; A i C . GGTGG GGAATGCAGT ACAAGTATCT CTTCGATATG 
1630 1640 1650 1660 1670 1680 

GGACTGGAAT TTGiGTGGCA AGACATGACT ACCCCAGCAA TCCACACATC ATATGGAGAC 
1690 1700 1710 1720 1730 ■ 74C 

ATGAAAGGGT TGCCCACCCG TCTACTCGTC ACCTCAgACT CCGTCACCAA TGCCTCTGAG 
1750 1760 1770 1780 1790 1800 

AAAAAGCTCG CAATTGAAAC TTGGGCTCTC TACTCCTACA ATCTCCACAA AGCAACTTGG 
1810 1820 1830 1840 1850 1860 

CATGGTCTTA GTCGTCTCGA ATCTCGTAAG AACAAACGAA ACTTCATCCT CGGGCGTGGA 
1870 1880 1890 1900 1910 1920 

AGTTATGCCG GAGCCTATCG TTTTGCTGGT CTCTGGACTG GGGATAATGC AAGTAACTGG 
1930 1940 1950 1960 1970 1980 

GAATTCTGGA AGATATCGGT CTCTCAAGTT CTTTCTCTGG GCCTCAATGG TGTGTGCATC 
1990 2000 2010 2020 2030 2040 

GCGGGGTCTG ATACGGGTGG TTTTGAACCC TACCGTGATG CAAATGGGGT CGAGGAGAAA 
2050 2060 2070 2080 2090 2100 

TACTGTAGCC CAGAGCTACT CATCAGGTGG TATACTGGTT CATTCCTCTT GCCGTGGCTC 
2110 2120 2130 2140 2150 2160 

AGGAACCATT ATGTCAAAAA GGACAGGAAA TGGTTCCAGG AACCATACTC GTACCCCAAG 
2170 2180 2190 2200 2210 2220 

CATCTTGAAA CCCATCCAGA ACTCGCAGAC CAAGCATGGC TCTATAAATC CGTTTTGGAG 
2230 2240 2250 2260 2270 2280 

ATCTGTAGGT ACTATGTGGA GCTTAGATAC TCCCTCATCC AACTACTTTA CGACTGCATG 
m 2290 2300 2310 2320 2330 2340 

TTTCAAAACG laGTCGACGG TATGCCAATC ACCAGATCTA TGCTCTTGAC CGATACTGAG 
2350 2360 2370 2380 2390 2400 

GATACCACCT TCTTCAACGA GAGCCAAAAG TTCCTCGACA ACCAATATAT GGCTGGTGAC 
2410 2420 2430 2440 2450 2460 

GACATTCTTG TTGCACCCAT CCTCCACAGT CGCAAAGAAA TTCCAGGCGA .AAACAGAGAT 
2470 2480 2490 2500 2510 "520 

GTCTATCTCC CTCT7TACCA CACCTGGTAC CCCTCAAAT7 TGAGACCATG GGACGATCAA 
2530 2540 2550 2560 2570 2580 

laGAGTCGCTT TGGGGAATCC TGTCGAAGGT GGTAGTGTCA TCAATTATAC TGCTAGGATT 
_ 2590 2600 2610 2620 2630 2640 

GTTGCACCCG AGGATTATAA TCTCTTCCAC AGCGTGGTAC CAGTCTACGT TAGAGAGGGT 
2650 2660 2670 2680 2690 2700 

GCCATCATCC CGCAAATCGA AGTACGCCAA TGGACTGGCC AGGGGGGAGC CAACCGCATC 
2710 2720 2730 2740 2750 2760 

AAGTTCAACA TCTACCCTGG AAAGGATAAG GAGTACTGTA CCTATCTTGA TGATGGTGTT 
2770 2780 2790 2800 2810 2820 

AGCCGTGATA GTGCGCCGGA AGACCTCCCA CAGTACAAAG AGACCCACGA ACAGTCGAAG 
2830 2840 2850 2860 2870 2880 

GTTGAAGGCG CGGAAATCGC AAAGCAGATT GGAAAGAAGA CGGGTTACAA CATCTCAGGA 
2890 2900 2910 2920 2930 2940 
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a-GLUCAN LYASE CODING SEQUENCE 
SEQUENCE TYPE: NUCLEIC ACID 
MOLECULE TYPE. DNA (GENOMIC } 
ORIGINAL SOURCE: -UNGUS 
SEQUENCE LENGTH 2213 BP 
STRANDECNESS : DOUBLE 
SEQUENCE : 

.__,_;2 i3 29 „/9 _ ._ 50 _ 60 

i uuuAuu.-. i : Ai w uoACC!_ . --iL.AA.iiC 7GCAAAGCAG AGGACiACTA CGCTGCiGCC 
_ "0 _80 90 100 110 120 

AAAGGC : GGA GTGGCCC CA GAA.GA i CATT CGCTATGACC AGACCCCTCC TCAGGGTACA 

v 130 IfO ^ 150 _ 160 170 _ 130 

AAAGAi„CGA AAAGCTGGCA iGCGGTAAAC CiiCCTTTCG ATGACGGGAC TAiGiGTGTA 
190 200 210 220 230 ?*Q 

GTGCAATTCG TCAGACCCTG TGTTTGGAGG GTTAGATATG ACCCCAGTGT CAAGACTTCT 
250 260 270 280 290 300 

GATGAGTACG GCGATGAGAA TACGAGGACT ATTGTACAAG ACTACATGAC TACTCTGGTT 
310 320 330 340 350 360 

GGAAACTTGG ACATTTTCAG AGGTCTTACG TGGGTTTCTA CGTTGGAGGA TTCGGGCGAG 
370 380 390 400 410 420 

TACTACACCT TCAAGTCCGA AGTCACTGCC GTGGACGAAA. CCGAACGGAC TCGAAACAAG 
430 440 450 460 470 480 

GTCGGCGACG GCCTCAAGA i TTACCTATGG AAAAATCCCT TTCGCATCCA GGTAGTGCGT 
*90 500 510 520 530 540 

CTCTTGACCC CCCTGGTGGA CCCTTTCCCC ATTCCCAACG TAGCCAATGC CACAGCCCGT 
550 560 570 580 590 600 

GTGGCCGACA AGGTTGTTTG GCAGACGTCC CCGAAGACGT TCAGGAAAAA CTTGCATCCG 
610 620 630 640 650 660 

CAGCATAAGA TGTTGAA.GGA TACAGTTCTT GATATTATCA AGCCGGGGCA CGGAGAGTAT 
670 680 690 - 700 710 720 

GTGGGTTGGG GAGAGATGGG AGGCATCGAG TTTATGAAGG AGCCAACATT CATGAATTAT 
_730 740 7 60 760 770 780 

T7CAAC. : iG ACAATATGCA ATATCAGCAG GTCTATGCAC AAGGCGCTCT TGATAGTCGT 
„ ^^90 800 810 820 830 840 

GAGCuuTTGi ATCACTCTGA TCCCTTCTAT CTCGACGTGA ACTCCAACCC AGAGCACAAG 
850 860 870 880 890 900 

AACATTACGG CAACCTTTAT CGATAACTAC TCTCAGATTG CCATCGACTT TGGGAAGACC 
910 920 930 940 950 960 

AACTCAGGCT ACATCAAGCT GGGTACCAGG TATGGCGGTA TCGATTGTTA CGGTATCAGC 
970 _ 980 990 1000 1010 1020 

GCGGATACGG i CCCGGAGAT TGTGCGACTT TATACTGGAC TTGTTGGGCG TTCGAAGTTG 
1030 1040 1050 1060 1070 1080- 

AAGCCCAGGT ATATTCTCGG AGCCCACCAA GCTTGTTATG GATACCAGCA GGAAAGTGAC 

1090 1100 1110 1120 1130 1140 

: ( GCATGCTG TTG i i CAGCA GTACCGTGAC ACCAAGTTTC CGCTTGATGG GTTGCA7GTC 
1150 _ 1160 1170 1180 1190 1200 

GATGTCGACT ; i CAGGACAA ~7CAGAACG TTTACCACTA ACCCGATTAC GTTCCCTAAT 
1210 1220 1230 1240 1250 1260 

CCCAAAGAAA TGTTTACCAA TCT.AAGGAAC AATGGAATCA AGTGTTCCAC CAACATCACC 
1270 1280 1290 1300 1310 1320 
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uiATCAGAGA -CGCCCGAAT GGG l ACAGTA CCCTCAAT6A GGGATATGAT 

_ i 34C _ 1350 1360 1370 1380 

hAAAAG i Al : TCATCA.GGA : GACAGATAT ACCGAGGGGA CAAGTGGGGA C3CGCAAAAT 

-390 1400 1410 i4?0 1430 

b. iC-oA-Ati C • 1 1 i lACGG 3GGTGGGAAC CCGGTTGAGG TTAACCCTAA TGA1 



• J/1Q 

"TGG 

„ _ „ _ ^ _ ^ ^ _ 2 Q Q 

GCiCuuZZAG AC . i iGGAGA CAATTATGAC 77CC3TACGA ACT7CAAC7G CA^-AGK'-' 

^^.-r?- 0 . ^Al 20 _ 153 § -5^0 1550 1560 

"-'"'•■l- u.^iuiu-b ■ . .-.LuGA i A i GGGAATGGCA CTCCAGGTTA CTAC'C'GA" 

1^§2 1§?9. _ 1600 l&iO 1620 

"ATG 



1450 1460 1470 1480 1490 



TTAACAGAG AGGAGGiiCG :ATCiGG.GG GGATTGCAGT ACGAGTATCT CTTCAA' 



530 



_loo0 1660 1570 1680 



buAciAGAGi :.GTATGGCA AGA i A i GACA ACCCCAGCGA TCCATTCAT" ajaTG-a.-ac 
1590 _ 1700 _ 1710 1720 1730 ~74C 

ATb.AAA.Guui iGCCCACCCG i CTGCTCGTC ACCGCCGACT CAGTTACCAA TGCCTC'GAG 
i/D ° _ !760 i77 0 1780 1790 1800 

AAAAAGCiCG CAAi iGAAAG TTGGGCTCTT TACTCCTACA ACCTCCATAA AGCAACCTTC 
- A ,. *§i° 1820 1830 1840 1850 i860 

CACbuiCTTG GTCGTCTTGA GTCTCGTAAG AACAAACGTA ACTTCATCCT CGGACGTGGT 
1870 1880 1890 1900 1910 1920 

AGTTACGCCG GTGCCTATCG TTTTGCTGGT CTCTGGACTG GAGATAACGC AAGTACGTGG 
^ 1930 1940 1950 1960 1970 igso 

uAATTCTGGA AGATTTCGGT CTCCCAAGTT CTTTCTCTAG GTCTCAATGG TGTGTGTATA 
r „„ 2000 2010 2020 2030 ?040 

GtuuGuTub ATA.CGGGTGG TTTTGAGCCC GCACGTACTG AGATTGGGGA GGAGAAATAT 
2050 2060 2070 2080 2090 "100 

TGCAGTCCGG AGCTACTCAT CAGGTGGTAT ACTGGATCAT TCCTTTTGCC ATGGCTTAGA 
„ 2110 2120 2130 2140 • 2150 ?160 

AAClACTA.CG TCAAGAAGGA CAGGAAATGG TTCCAGGAAC catacgcgta ccccaagcat 
„ w 2i/0 2180 2190 2200 2210 22?0 

CTTbAAACCC ATCCAGAGCT CGCAGATCAA GCATGGCTTT ACAAATCTGT TCTAGAAATT 
2230 2240 2250 2260 2270 2280 

TGCAGATACT GGGTAGAGCT AAGATATTCC CTCATCCAGC TCCTTTACGA CTGCATGTTC 
2290 2300 2310 2320 2330 7340 

CAAAACGTGG TCGATGGTAT GCCACTTGCC AGATCTATGC TCTTGACCGA TACTGAGGAT 
2350 2360 2370 2380 2390 2400 

ACGACCTTCT TCAATGAGAG CCAAAAGTTC CTCGATAACC AATATATGGC TGGTGACGAC 

AT ,2410 2420 2430 2440 2450 2460 

ATubTTGTAG uACCCATCCT CCACAGCCGT AACGAGGTTC CGGGAGAGAA CAGAGATGTC 
24/0 2480 2490 2500 2510 ?520 

TATCTCCCTC TATTCCACAC CTGGTACCCC TCAAACTTGA GACCGTGGGA CGATCAGGGA 
w -i2 30 2540 2550 2560 ?570 ?S80 

uiCGlTTTAG GGAATCCTGT CGAAGGTGGC agcgttatca ACTACACTGC CAGGATTGTT, 
rrr „ 2d90 2600 2610 2620 2630 2640 

GCCw'wAGAGu aiiAiAATCT CTTCCACAAC GTGGTGCCGG TCTACATCAG AGAGGGTGCC 
ATr ^_26d0 2660 2670 2680 2690 ?700 

ATcA. :CCGC AAATTCAGGT ACGCCAGTGG ATTGGCGAAG GAGGGCCTAA TCCCATCAAG 
^2^10 2720 2730 2740 2750 ?760 

TTlh*iA,CT A.CCCTGGAAA GGACAAGGAG TATGTGACGT ACCTTGATGA TGGTGTTAGC 
T - 770 2780 2790 2800 2810 2820 

CGcuAi A.GTG CACCAGATGA CCTCCCGCAG TACCGCGAGG CCTATGAGCA AGCGAAGGTC 
2830 2840 2850 2860 2870 2880 
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